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METHODS FOR REFOLDING PROTEINS CONTAINING 
FREE CYSTEINE RESIDUES 

Field of the Invention 

Tiie present invention relates generally to methods of making proteins and more specifically to 
recombinant proteins containing at least . one "free" cysteine residue, i.e., a cysteine residue that does not 
participate in a disulfide bond. 

Background of the Invention 

Protem therapeutics generally must be administered to patients by injection. Most protein 
therapeutics are cleared rapidly from the body, necessitatmg frequent, often daily, injections. There is 
considerable interest in the development of methods to prolong the circulating half-lives of protein 
therapeutics in the body so that the proteins do not have to be injected frequently. Covalent modification of 
proteins with polyethylene glycol (PEG) has proven to be a usefiil method to extend the circulating half- 
lives of proteins in the body (Abuchowski et al., 1984; Hershfield, 1987; Meyers et al., 1991). Covalent 
attachment of PEG to a proteui increases the protem*s elBTective size and reduces its rate of clearance from 
the body. PEGs are commercially available in several sizes, allowing the circulating half-lives of PEG- 
modified proteins to be tailored for individual indications through use of different size PEGs. Other 
documented in vivo benefits of PEG modification are an increase in protein solubility and stability, and a 
decrease in protein imraunogenicity {Katre et al., 1987; Katre, 1990). 

One known method for PEGylating proteins covalently attaches PEG to cysteme residues using 
cysteine-reactive PEGs. A number of highly specific, cysteme-reactive PEGs with different reactive groups 
(e.g., maleimide, vinylsulfone) and different size PEGs (2-40 kDa, single or branched chain) are 
commercially available. At neutral pH, these PEG reagents selectively attach to "free" cysteine residues, 
i.e., cysteine residues not involved in disulfide bonds. Cysteine residues in most proteins participate in 
disulfide bonds and are not available for PEGylation using cysteine-reactive PEGs. Through in vitro 
mutagenesis using recombinant DNA techniques, additional cysteine residues can be introduced anywhere 
into the protein. The newly added "free" or "non-natural" cysteines can serve as sites for the specific 
attachment of a PEG molecule using cysteine-reactive PEGs. The added "free" or **non-natural" cysteine 
residue can be a substitution for an existing amino acid m a protein^ added preceding the amino-terminus of 
the mature protein or after the carboxy-tecminus of the mature protein, or inserted between two normally 
adjac^t anuno acids in tiie protein. Alternatively, one of two cysteines involved in a native disulfide bond 
may be deleted or substituted with another ansino acid, leaving a native cysteine (the cysteine residue in tibe 
protein that normally would form a disulfide bond with the deleted or substituted cysteine residue) free and 
available for chemical modification. Preferably the amino acid substituted for the cysteine would be a 
neutral amino acid such as serine or alanine. For example, human growtii hormone (hGH) has two disulfide 
bonds that can be reduced and allQrlated with iodoacetamide without impaujng4)ioiogicaI activity O^wley 
et al., (1969). Each of die four cysteines would-be reasonable targets for deletion or substitution by another 
amino acid. 
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Several naturally occurring proteins are known to contain one or more '%ee'' cysteine residues. 
Examples of such naturally occurring proteins include human Interleukin (IL)-2 (Wang et al., 1984), beta 
interferon (Mark et al., 1984; 1985), G-CSF (Lu et al., 1989) and basic fibroblast growth factor (bFGF, 
Thompson, 1992), IL-2, Granulocyte Colony-Stimulating Factor (G-CSF) and beta interferon (IFN-P) 
5 contain an odd niunber of cysteine residues, whereas basic fibroblast growth factor contams an even number 
of cysteine residues. 

Expression of recombinant proteins containmg firee cysteine residues has been problematic due to 
reactivity of the free sulfhydryl at physiological conditions. Several recombinant proteins contaiinng free 
cysteines have been expressed cytoplasmically, i.e., as intracellular proteins, in bacteria such as E, colu 

10 Examples include natural proteins such as IL-2, beta interferon, G-CSF, and engineered cysteine muteins of 
IL-2 (Goodson and Katre, 1990), IL-3 (Shaw et al., 1992), Tumor Necrosis Factor Binding Protein (Tuma et 
al., 1995), Insulin-like Growth Factor-I (IGF-I, Cox and McDermott, 1994), Insulin-like Growth Factor 
binding protein-l (IGFBP-1, Van Den Berg et al., 1997) and protease nexin and related proteins (Braxton, 
1998). All of these proteins were predominantly insoluble when expressed intracellularly in E. coli. The 

15 insoluble proteins were largely inactive and needed to be refolded in order to regain significant biological 
activity. In some cases the reducing agent dithiothreitol (DTT) was used to aid solubilization and/or 
refolding of the insoluble proteins. Purified, refolded IL-2, G-CSF and beta interferon proteins are unstable 
and lose activity at physiological pH, apparently due to disulfide rearangements involving the free cysteine 
residue (Wang et al., 1984; Mark et al., 1984; 1985; Oh-eda et al., 1990; Arakawa et al., 1992). 

20 Replacement of the free cysteine residue in these proteins with serine, resulted in a protein that was more 
stable at physiological pH (Wang et aL, 1984; Mark et al., 1984; 1985; Arakawa et al., 1993). 

A second known method for expressmg recombmant proteins in bacteria is to secrete them into the 
periplasmic space or into the media. It is known that certain recombinant proteins such as GH are e}q)ressed 
in a soluble active form when they are secreted into the E. coli periplasm, whereas they are insoluble v^en 

25 expressed intracellularly in E, coli. Secretion is achieved by fiising DNA sequences encoding GH or other 
proteins of interest to DNA sequences encoduag bacterial signal sequences such as those derived from the 
stn (Fujimoto et al., 1988) and ompA proteins (Ghrayeb et al., 1984). Secretion of recombinant protems in 
bacteria is desirable because the natural N-terminus of the recombinant protein can foe maintained. 
Intracellular expression of recombinant proteins requires that an N-terrainal methionine be present at the 

30 amino-terminus of the recombinant protein. Methionine is not normally present at the amino-terminus of the 
mature forms of many human proteins. For example, the amino-terminal amino acid of the mature form of 
human GH is phenylalanine. An amino-terminal methionine must be added to the amino-terminus of a 
recombinant protein, if a methionine is not present at this position, in order for the protein to be expressed 
efficiently m bacteria. Typically addition of the amino-terminal methionine is accomplished by adding an 

35 ATG methionine codon preceding the DNA sequence encoding the recombinant protein. The added N- 
terminal methionine often is not removed from the recombinant protein, particularly if the recombinant 
protein is insoluble. Such is the case with hGH, where the N-terminal methionine is not removed when fte 
protein is expressed mtracellularly in £. coll The added N-t^minal methionine creates a ^'non-natural" 
INTOtein that potentially can stimulate an immime response in a human, fri contrast, there is no added 
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methionine on hGH that is secreted into the periplasmic space using stil (Chang et al., 1987) or ompA 
(Cheah et al., 1994) signal sequences; fee recombinant protem begins with the native amino-terminal amino 
acid phenylalanine. The native hGH protein sequence is maintained because bacterial enzymes cleave the 
stn-hGH protein (or ompA-hGH protein) between the sttl (or on^ A) signal sequence and the start of the 

5 mature hGH protein. 

hGH has four cysteines that form two disulfides. hGH can be seoreted into the E. coli periplasm 
using stn or ompA signal sequences. The secreted protein is soluble and biologically active (Hsiung et al., 
1986). The predominant secreted form of hGH is a monomer with an apparent molecular weight by sodium 
dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) of 22 kDa. Recombinant hGH can be 

10 isolated from the periplasmic space by using an osmotic shock procedure (Koshland and Botstein, 1980), 
which preferentially releases periplasmic, but not intracellular, proteins into the osmotic shock buffer. The 
released hGH protein is then purified by column chromatography (Hsiung et al., 1986). A large number of 
GH mutants have been secreted into the E. coli periplasm The secreted mutant proteins were soluble and 
could be purified using procedures similar to those used to purify wild type GH (Cunningham and Wells, 

15 1989; Fuh et aL, 1992). Unexpectedly, when similar procedures were used to secrete GH variants 
containing a free cysteine residue (five cysteines; 2N+1), it was discovered that certain recombinant GH 
variants were insoluble or formed multimers or aggregates when isolated using standard osmotic shock and 
purification procedures developed for GH. Very little of the monomeric GH variant proteins could be 
detected by non-reduced SDS-PAGE in the osmotic shock lysates. Insoluble or aggregated GH variants 

20 have reduced biological activities compared to soluble, properly folded hGH. Methods for refolding 
msoluble, secreted Growth Hormone variants contammg a free cysteine residue into a biologically active 
form have not been described. 

Alpha interferon (IFN-a2) also contains four cysteine residues that form two disulfide bonds. IFN- 
a2 can be secreted into the E. coli periplasm using the sfll signal sequence (Voss et al., 1994). A portion of 

25 the secreted protein is soluble and biologically active (Voss et al., 1994). Secreted, soluble recombinant 
IFN-a2 can be purified by colunm chromatography (Voss et al., 1994). When shnilar procedures were 
attempted to secrete IFN-c^ variants containing a free cysteme residue (five cysteines; 2N+1), it was 
discovered that certain of the recombmant IFN-ct2 variants were predominantly insoluble or formed 
multuners or aggregates \rfien isolated using standard purification procedures developed for IFN-a2. 

30 Insoluble or aggregated IFN-a2 variants have reduced biological activities con?)ared to soluble, properly 
folded IFN-a2. Methods for refolding msoluble, secreted IFN-a2 variants containing a free cysteine 
residue into a biologically active form have not been described. 

Human Granulocyte Colony-Stimulating Factor (G-CSF) contains five cysteme residues that form 
two disulfide bonds. The cysteme residue at position 17 m the mature protem sequence is free. Perez-P^rez 

35 et al. (1995) reported that G-CSF could be secreted into the E. coli periplasm using a variant form of the 
ompA signal sequence. However, yery little of the ompA-G-CSF fusion protein was correctly processed to 
yield mature G-CSF. The percentage of correctly processed G-CSF could be improved by co-expressing the 
E. coli dnaK and dnaJ proteins in the host eells expressing the ompA-G-CSF fusion protem (Perez-Perez et 
al., 1995). Correctly processed, secreted G-CSF was lar^ly insoluble in all E, <:oli stramsexammed (Perez- 
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Perez et al., 1995). Insoluble G-CSF possesses reduced biological activity compared to soluble, properly 
folded G-CSF. When similar procedures were atten5)ted to secrete wild type G-CSF, G-CSF variants in 
which the free cysteine residue was replaced with serine [G-CSF (C17S)], and G-CSF (C17S) variants 
containing a free cysteine residue (five cysteines; 2N+1) using the stn signal sequence, it was discovered 

5 that the recombinant G-CSF proteins also were predominantly insoluble. Methods for refolding msoluble, 
secreted G-CSF proteins into a biologically active form have not been described. 

Human Granulocyte Macrophage Colony-Stimulating Factor (GM-CSF) contains four cysteine 
residues that fonn two disulfide bonds. Libbey et al. (1987) and Greenberg et al. (1988) reported that GM- 
CSF could be secreted into the E. coli periplasm using the ompA signal sequence. Correctly processed, 

10 secreted GM-CSF was insoluble ( Libbey et al., 1987; Greenberg et al., 1988). Insoluble GM-CSF 
possesses reduced biological activity compared to soluble, properly folded GM-CSF. When similar 
procedures were attempted to secrete GM-CSF variants contauiing a free cysteine residue (five cysteines; 
2N+1) using the stil signal sequence, it was discovered that the recombinant GM-CSF proteins also were 
predominantly insoluble. Methods for refolding insoluble, secreted GM-CSF proteins into a biologically 

1 5 active form have not been described. 

US Patent No. 5,206,344 and Goodson and Katre (1990) describe expression and purification of a 
cysteine substitution mutein of IL-2. The IL-2 cysteine mutein was insoluble when expressed intracellularly 
in E. colL The protein was solubilized by treatment with a denaturing agent [either 10% sodium dodecyl 
sulfate (SDS) or 8M urea] and a reducing agent [100 mM dithiothreitol (DTT)], refolded and purified by 

20 size-exclusion chromatography and reversed phase HPLC. Expression and purification of cysteine muteins 
of IL-3 are described in US Patent No. 5,166,322. The IL-3 cysteine mutems also were insoluble when 
expressed intracellularly in E. coli. The protems were solublilized with a denaturing agent (guanidine) and a 
reducing agent (DTT), refolded and purified by reversed phase HPLC. The purified IL-3 cysteme muteins 
were kept m a partially reduced state by inclusion of DTT in the storage buffers. When the inventors used 

25 only a denaturing agent agent and a reducing agent (DTT) to denature and refold insoluble cysteine niuteins 
of GH and G-CSF, it was discovered that the refolded proteins were heterogeneous, coniprising multiple 
molecular weight species. Similarly, when the inventors denatured and refolded insoluble, secreted IFN-a2 
cysteine muteins with only a denaturing agent and a reducing agent (DTT), undetectable levels of properly 
folded IFN-a2 cysteine muteins were obtained. 

30 Malik et al. (1992) and Knusli et al. (1992) described conjugation of wild tpe GM-CSF with 

amine-reactive PEG reagents. The amine-PEGylated GM-CSF comprised a heterogeneous mixture of 
different molecular weight PEG-GM-CSF species modified at multiple amino acid residues (Malik et al. 
1992; Knusli et al. , 1992). The various amine-PEGylated GM-CSF species could not be purified from each 
other or from non-PEGylated GM-CSF by conventional chromatography methods, which prevented specific 

35 activity measurements of the various isoforms from being determined. Clark et al. (1996) described 
conjugation of GH with amine-reactive PEGs. Amine-PEGylyated GH also was heterogeneous, comprising 
a mixture of mutiple molecular weight species modified at multiple amino acid residues. The amine- 
PEGylated GH proteins displayed significantly reduced biological activity (Clark et al., 1996). Monkarsh et 
al. (1997) described amine-PEGylated alpha interferon, which also comixised multiple molecular wei^t 
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species modified at different amino acid residues. Amine-PEGylated alpha interferon also displayed 
reduced biological activity. Tanaka et al. (1991) described amine-PEGylated G-CSF, which also comprised 
a heterogeneous mixture of different molecular weight species modified at different amino acid residues. 
Amine-PEGylated G-CSF displayed reduced biological activity (Tanaka et al., 1991). Kinstler et al. (1996) 
5 described a PEGylated G-CSF protein that is preferentially modifed at the non-natural N-terminal 
methionine residue. This protein also displayed reduced biological activity (Kinstler et al. 1996). 

Therefore, despite considerable effort, a need still exists for methods that allow an insoluble or 
aggregated protein containing one or more free cysteine residues to be refolded into a soluble, biologically 
active form in high yield. The present invention satisfies this need and provides related advantages as well. 
10 Similarly, a need also exists for methods of generating homogeneous preparations of long acting 
recombinant proteins by enhancement of protein molecular wei^t, such as by PEGylation. 

Summary of the Invention 

The present invention generally relates to methods for obtaining refolded, soluble forms of protems 

15 having one or more free cysteine residues and which are expressed by a host cell in an insoluble or 
aggregated form. Such proteins include, but are not limited to, members of the Growth Hormone supergene 
family, such as GH, IFN-a2, G-CSF and GM-CSF proteins, and anti-angiogenesis factors, such as 
endostatin and angiostatin. The methods are generally accomplished by (a) causing a host cell to express a 
protein containing a free cysteine residue in an insoluble or aggregated form; (b) lysing the cell; (c) 

20 solubilizing the insoluble or aggregated protein in the presence of a denaturing agent, a reducmg agent and a 
cysteine blocking agent; and (d) refolding the protein by lowering the concentrations of the denaturing 
agent and reducing agents to levels sufficient to allow the protein to renature to a biologically active form. 
Optionally, the soluble, refolded protem is isolated from other proteins hi the refold mixture. 

Suitable host cells uiclude bacteria, yeast, insect or mammalian cells. Preferably, the host cell is a 

25 bacterial cell, particularly E.colL 

Preferably, the soluble, refolded proteins produced by the methods of the present invention are 
recombinant proteins, especially cysteine variants or cysteine muteins of a protein. As used herein, the 
terms "cysteine varianf and "cysteine mutem" are meant to encompass any of the following changes in a 
protein's amino acid sequence: addition of a non-natural cysteine residue preceding the amino terminus of 

30 the mature protein or followmg the carboxy-terminus of the mature protein; substitution of a non-natural 
cysteine residue for an existing amino acid in the protein; introduction of a non-natural cysteine residue 
between two normally adjacent amino acids in the protein; or substitution of another amino acid for a 
naturally occurring cysteine residue that normally form a disulfide bond in the protein. The methods are 
useful for producing proteins including, without limitation, GH, G-CSF, GM-CSF and interferon, especially 

35 alpha interferon, cysteine variants of these proteins, their derivatives or antagonists. Other proteins for 
which the methods are useful include other members of the GH supergene family, the Transforming Growth 
Factor (TGF)-beta superfamily, platelet derived growth factor-A, platelet derived growth &ctor-B, nerve 
growth fector, brain derived neurotophic factor, neurotrophin-3, neuroti*ophin-4, vascular endothelial growth 
factor, chemokines, hormones, endostatin, angiostatin, cysteine muteins of these {Hoteins, or a derivative or 
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an antagonist thereof. Cysteine muteins of heavy or light chains of an immunoglobulin or a derivative 
thereof are also contemplated. 

As used herein, the term "cysteine blocking agenf means any reagent or combination of reagents 
that result in the formation of a reversibly blocked fiee cysteine residue in a protein. Examples of useful 
5 cysteine blocking agents include, but are not limited to, dithiols such as cystine, cystamine, oxidized 
glutathione, dithioglycolic acid and the like, or thiols such as cysteine, cysteamine, thioglycolic acid, and 
reduced glutathione. Preferably, thiols should be used in the presence of an oxidizing agent. Useful 
oxidizing agents include oxygen, iodine, ferricyanide, hydrogen peroxide, dihydroascorbic acid, 
tetrathionate, and O-iodosobenzoate. Optionally, a metal ion such as copper (Cu**) or cobalt (Co"*^ can be 

10 added to catalyze the oxidation reaction. Although not wishing to be bound by any particular theory, the 
inventors postulate that the cysteine blocking agent forms a mixed disulfide with the free cysteine residue in 
the protein, thus limiting possible disulfide rearrangments that could occur involving the free cysteine 
residue. The mixed disulfide stabilizes the free cysteine residue, significantly enhancing the yield of 
properly folded, biologically active, soluble protein. As used herein, reducing agents such as DTT and 2- 

15 mercaptoethanol are not considered cysteine blocking agents because they do not result in the formation of a 
reversibly blocked mixed disulfide with the free cysteine residue in the protein. DTT typically does not 
form mixed disulfides with cysteine residues in proteins due to a thermodynamically preferred 
intramolecular bond that forms upon oxidatioa 

Higher order dimeric and multimeric proteins formed by the covalent association of two or more of 

20 the refolded proteins via dieir free cysteine residues also within the present invention. 

The present methods further include various methods of attaching a cysteineH:eactive moiety to the 
refolded protein to form modified protein in which the cysteine-reactive moiety is attached to the refolded 
protein through the free cysteine residue(s). An example of a usefiil cysteine-reactive moiety that can be 
attached to the refolded protein is a cysteine-reactive PEG, vHnch can be used to form a PEGylated protein. 

25 Such methods include (a) isolating the refolded protein having a free cysteine residue from other proteins m 
the refold mixture; (b) reducing, at least partially, the isolated, refolded protein mfb a disulfide-reducing 
agent and (c) exposing the protein to a cysteine-reactive moiety such as a cysteine-reactive PEG. 
Optionally, the modified protein can be isolated from unmodified protein. Examples of other useful 
cysteine-reactive moieties are cysteine-reactive dextrans, cysteine-reactive carbohydrates, cysteine-reactive 

30 poly (N-vinylpyrroUdone)s, cysteine-reactive peptides, cysteine-reactve lipids, and cysteine-reactive 
polysaccharides. 

The present invention further includes the soluble, refolded proteins and their derivatives, mcluding 
PEGylated proteins, made by the methods disclosed herein. Such PEGylated proteins include 
monopegylated, cysteine variants of GH, G-CSF, GM-CSF and alpha interferon proteins. Such PEGylated 
35 proteins also include cysteine variants of GH, G-CSF, GM-CSF and alpha interferon proteins modified with 
two or more PEG molecules, where at least one of the PEG molecules is attached to the protein through a 
free cysteine residue. 
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Detailed Description of the Invention 

The present invention provides novel methods of preparing refolded, soluble forms of GH, IFN-a2, 
G-CSF and GM-CSF proteins that have at least one free cysteine residue and which are expressed by a host 
cell in an insoluble or aggregated form. The present invention can be used to prepare refolded, soluble 
forms of other members of the GH supergene fmily that have at least one fiee qrsteine residue and which 
are expressed by a host cell in an insoluble or aggregated form. The present invention also can be used to 
prepare refolded, soluble fonns of other types of proteins having at least one free cysteine residue and which 
are expressed by a host cell in an insoluble or aggregated form, including, but not limited to, anti* 
angiogenesis proteins such as endostatin and angiostatin. The invention further provides novel proteins, 
particularly recombinant proteins produced by these novel methods as well as derivatives of such 
recombmant proteins. The novel methods for preparing such proteins are generally accomplished by: 

(a) causing a host cell to express a protein having a free cysteine in an insoluble or aggregated 
form; 

(b) lysing the host cell by chemical, enzymatic or physical means; 

(c) solubilizing the insoluble or aggregated protein by exposing the protein to a denaturing 
agent, a reducing agent and a cysteine blocking agent; and 

(d) refolding the protein by reducing the concentrations of the denaturing agent and reducing 
agent in the solubilization mixture to levels sufScient to allow the protem to renature into 
a soluble, biologically active form. 

Optionally, the refolded, soluble protein can be isolated from other proteins in the refold mixture. The 
methods and other embodiments of the present invention were described in detail in U.S. Provisional 
Application Serial No. 60/204,617, filed May 16, 2000. U.S. Provisional Application Serial No. 
60/204,617 is incorporated herein by reference in its entirety. 

As identified above, the first step in these methods is to cause a host cell to express a protein 
having a firee cysteine residue in an insoluble or aggregated form. Suitable host cells can be prokaryotic or 
eukaiyotic. Exanq)les of appropriate host cells that can be used to e^tpress recombinant protems include 
bacteria, yeast, insect and mammalian cells. Bacteria cells are particularly usefril, especially ExolL 
Methods of causing a host cell to express a protein are well known in the art and examples are provided 
herein. 

As used herein, the term "protein having a free cysteine residue" means any natural or recombinant 
protein or peptide that contains 2N+1 cysteine residues, where N can be 0 or any integer, and any natural or 
recombinant protein or peptide that contain 2N cysteines, where two or more of the cysteines do not 
normally participate in a disulfide bond. Thus, the methods of the present invention are useful in enhancing 
the expression, recovery and purification of any protein or peptide having a free cysteine, particularly 
cysteine added variant recombinant proteins (referred to herein as "cysteine muteins" or "cysteine variants") 
having one or more free cysteines. Although the expression, recovery and purification of a natural protein 
having a fiiee cysteine expressed by its natural host cell can be enhanced by the methods of the present 
invention, die description herein predominantly refers to recombinant protems for illustrative purposes only. 
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Jn addition, the proteins can be derived from any animal species including human, companion animals and 
&rm animals. The proteins also can be derived from plant species or microbes. 

Accordingly, the present invention encompasses a wide variety of recombmant proteins, and 
cysteine variants of these proteins. These protems include members of the GH supergene family, aiid 
5 cysteme variants of these proteins. The following proteins ("collectively referred to as the GH siq)ergene 
family**) are encoded by genes of the GH supergene family (Bazan (1990;1991; 1992); Mott and Campbell 
(1995); Silvennoinen and Ihle (1996); Martin et al. (1990); Hannum et al. (1994); Blumberg et al., 2001): 
GH, prolactin, placental lactogen, erythropoietin (EPO), thrombopoietin (TPO), interleukin-2 (IL-2), IL-3, 
IL^, IL-5, IL-6, IL-7, IL-9, IL-10, IL-11, IL-12 (p35 subunit), IL-13, IL-15, IL-19, IL-20, IL-TIF, MDA-7, 

10 AK-155, oncostatin M, ciliary neurotrophic factor, leukemia inhibitory factor, alpha interferon, beta 
interferon, gamma interferon, omega interferon, tau interferon, granulocyte-colony stimulating factor (G- 
CSF), granulocyte-macrophage colony stimulating factor (GM-CSF), macrophage colony stimulating factor 
(M-CSF), cardiotrophin-l (CT-1), Stem Cell Factor and the flt3/flk2 ligand. It is anticipated that additional 
members of the GH supergene family will be identified in the future through gene cloning and sequencing. 

15 Members of the GH supergene family have similar secondary and tertiary structures, despite the fact that 
they generally have limited amino acid or DNA sequence identity. The shared structural features of 
members of the GH supergene family, which are described in Bazan (1990; 1991; 1992), Mott and 
Campbell (1995) and Silvennoinen and Ihle (1996), allow new members of the gene family to be readily 
identified. Variants of tiiese proteins such as the selective IL-2 antagonist described by Shanafelt et al. 

20 (2000) also are encompassed by this invention 

The present methods also can enhance the expression, recovery and purification of additional 
recombinant proteins, mcluding members of the TGF-beta superfamily. Members of the TGF-beta 
superfamily include, but are not limited to, glial-derived neurotrophic fiictor (GDNF), transforming growth 
fector-betal (TGF-betal), TGF-beta2, TGF-beta3, inhibin A, inhibin B, bone morphogenetic protein-2 

25 (BMP-2), BMP-4, inhibin alpha, Mullerian inhibiting substance (MIS), and OP-1 (osteogenic protein 1). 
The monomer subunits of the TGF-beta superfemily share certain structural features that allow other 
members of this family to be readily identified: they generally contam 8 higjily conserved cysteine residues 
that foim 4 intramolecular disulfides. Typically a ninth conserved cysteine is fi-ee in the monomeric form of 
the protein but participates in an intermolecular disulfide bond formed during the homodimerization or 

30 heterodimerication of the monomer subunits. Other members of the TGF-beta superfamily are described by 
Massague (1990), Daopin et al. (1992), Kingsley (1994), Kutty et al. (1998), and Lawton et al. (1997), 
incorporated herein by reference. 

Immunoglobulin (Ig) heavy and light chain monomers also contain cysteine residues that 
participate in intramolecular disulfides as well as free cysteines (Roitt et al, 1989 and Paul, 1989). These 

35 firee cysteines normally only participate in disulfide bonds as a consequence of multimerization events such 
as heavy chain homodimerization, heavy chain - hght chain heterodimerization, homodimerization of the 
(heavy chain - light chain) heterodimers, and other higher order assemblies such as pentamerization of the 
(heavy chain - light chain) heterodimers in the case of IgM. Thus, flie methods of the present invention can 
be employed to «idiance the expression, recovery and purification of heavy and/or ligjit chains (or various 



SUBSTITUTE SHEET (RULE 26) 



wo 01/87925 



PCT/USOl/16088 



domains thereof) of human inununoglobulins such as for example IgGl, IgG2, IgG3, IgG4, IgM IgAl, 
IgA2, secretory IgA, IgD and IgE» and cysteine variants of these proteins or fragments thereof. 
Immunoglobulins from other q)ecies could also be similarly expressed, recovered and purified. Proteins 
genetically fused to immunoglobulins or immunoglobulin domains, as described in Chamow & Asbkenazi 
5 (1996), could also be sunilarly expressed, recovered and purified. 

A group of proteins has been classed as a structural superfamily based on the shared structural 
motif termed the "cystine knot". The cystine knot is defmed by six conserved cysteine residues that form 
three intramolecular disulfide bonds that are topologically "knotted" (McDonald and Hendrickson, 1993). 
These proteins also form homo- or heterodimers and in some but not all instances dimerization involves 

10 intermolecular disulfide formation. Members of this family include the members of the TGF-beta 
superfamily and other proteins such as platelet derived growth factor-A (PDGF-A), PDGF-B, nerve growth 
factor (NOP), brain derived neurotrophic factor (BDNF), neurotrophin-3 (NT-3), NT-4, and vascular 
endothelial growth factor (VEGF), Cysteine blocking reagents also could enhance expression, recovery and 
purification of proteins with this structural motif, and cysteine-added variants of these proteins. 

15 The present methods also can enhance the expression, recovery and purification of other 

recombinant proteins and/or cysteine added variants of those proteins. Classes of proteins for which &e 
present methods would be useful include proteases and other enzymes, protease inhibitors, cytokines, 
c^kine antagonists, cytokine "selective agonists*', allergens, chemokines, gonadotrophins, chemotactins, 
lipid-bindmg proteins, pituitary hormones, growth &ctors, somatomedins, unmunoglobulins, interleukins, 

20 interferons, soluble receptors, extracellular domains of cell-surfrtce receptors, vaccines, single chain 
antibodies and hemoglobms. Specific examples of proteins include, for example, leptin, insulin, insulin- 
like growth &ctor I and II (IGF-I and IGF-U), superoxide dismutase, catalase, asparaginase, uricase, 
fibroblast growth &ctors, arginase, angiostatin, endostatin, Factor Vm, Factor DC, interleukin 1 receptor 
antagonist, parathyroid hormone, growth hormone releasing factor, calcitonin, extracellular domain of the 

25 VEGF receptor, protease nexin and auti-thrombin m. 

Other protein variants that would benefit tcom PEGylation and would therefore be reasonable 
candidates for cysteine added modifications include proteins or peptides with poor solubility or a tendency 
to aggregate, proteins or peptides that are susceptable to proteolysis, proteins or peptides needing improved 
mechanical stability, proteins or peptides that are cleared rapidly from the body, or proteins or peptides with 

30 undesirable immunogenic or antigentic properties. 

If desired, cysteine and other amino acid muteins of these proteins can be generally constructed 
using site-directed PCR-based mutagenesis as decribed in the Examples below and in PCTAJS98/14497 and 
PCT/USOO/0093, each of which is mcorporated by reference in its enturety. Methods for constructing 
muteins using PGR based PGR procedures also are described in general in Methods in Molecular Biology, 

35 Vol. 15: PCR Protocols: Current Methods and Applications edited by White, B. A. (1993) Humana Press, 
Inc., Totowa, NJ and PCR Protocols: A Guide to Methods and Applications edited by Innis, M. A. et al. 
(1 990) Academic Press, Inc. San Diego, CA. 

Melbods known in the art can be used to induce expression of a protein in the cyt(q>lasm or to 
direct secretion of the protein, depending on cell origm, including, for exanq)le, Ae methods described in 



SUBSTITUTE SHEET (RULE 26) 



wo 01/87925 



10 



PCT/USOl/16088 



the Examples below. A wide variety of signal peptides have been used successfully to transport proteins to 
the periplasniic space of E. coli. Examples of these include prokaryotic signal sequences such as ompA, 
stil, PhoA signal (Denefle et al., 1989), OmpT (Johnson et al., 1996), LamB and OmpF (Hoffman and 
Wright, 1985), beta-lactamase (Kadonaga ct al, 1984), enterotoxins LT-A, LT-B (Morioka-Fujimoto et al., 
5 1991), and protein A from S. aureus (Abrahmsen et al., 1986). A number of non-natural, synthetic, signal 
sequences that facilitate secretion of certain proteins are also known to those skilled in the art. 

Next, the host cell is lysed. Cell lysis can occur prior to, or coincident with, the solubilization 
procedures described below. Cell lysis can be accomplished by, for example, mechanical sheer such as a 
French pressure cell, enzymatic digestion, sonication, homogenization, glass bead vortexing, detergent 

10 treatment, organic solvents, freeze thaw, grinding with alumina or sand, treatment with a denaturing agent as 
defined below, and the like (Bollag et al., 1996). Optionally, the cells can be lysed in the presence of a 
denaturing agent, a disulfide reducing agent, or a cysteine-blocking agent. Optionally, insoluble or 
aggregated material can be separated from soluble proteins by various methods such as centrifugation, 
filtration (including ultrafiltration), precipitation, floculation, or settling. 

15 Next the insoluble or aggregated material (or whole cells without prior lysis) is rendered soluble or 

monomeric by exposing the insoluble or aggregated material (or whole cells without prior lysis) to a 
denaturing agent, and a disulfide reducing agent that also is a cysteine-blocking agent. Useful denaturing 
agents include urea, guandine, arginine, sodium thiocyanate, extremes in pH (dilute acids or bases), 
detergents (SDS, sarkosyl ), salts (chlorides, nitrates, thiocyanates, cetylmethylanunonium salts, 

20 trichloroacetateS) , chemical derivatization (sulfitolysis, reaction with citraconic anhydride), solvents (2- 
amino-2-methyl-l-propanol or other alcohols, DMSO, DMF) or strong anion exchange resins such as Q- 
Sepharose. Useful concentrations of urea are 1-8 M, with 5-8 M being preferred concentrations. Useful 
concentrations of guanidine are 1-8 M, with 4-8 M being prreferred concentrations. Useful disulfide 
reducing agents that also are cysteine blocking agents include, but are not limited to, thiols such as cysteine, 

25 thioglycolic acid, reduced glutathione and cysteamine. These compounds can be used in the range of 0.5 to 
200 mM, with 1-50 mM being preferred concentrations. Cysteine, reduced glutathionine, thioglycolic acid 
and cysteamine are preferred reducing agents because they also are cysteine blocking agents, i.e., they 
interact with the free cysteine residue in the protein to form a reversibly blocked free cysteine residue. Use 
of a disulfide-reducing agent that also is a cysteine blocking agent during the solubilization step reduces the 

30 number of compounds and steps required in the overall process for refolding the insoluble or aggregated 
protein to a soluble, active form. Furthermore, use of a cysteine blocking agent results in a form of the 
refolded protein that is suitable for derivatization at the free cysteine residue using variuos cysteine-reactive 
moieties and procedures described below. Preferably, the pH of the denaturation/reduction mixture is 
between pH 6 and pH 10. 

35 The next step in the procedure is to refold the protein to obtain the protein's native conformation 

and native disulfide bonds. Refolding is achieved by reducing the concentrations of the denaturing agent 
and reducing agent to levels sufficient to allow the protein to renature into a soluble, biologically active 
form This can be achieved through dialysis, dilution, gel fitration, precipitation of the protein, or by 
immobilization on a resin followed by buffer washes. Conditions for this step are chosen to allow for 
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regeneration of the protein's native disulfide bond(s). This can be accomplished tfarougji addition of an 
oxidizing agent, or a redox mixture of an oxidizing agent and a reducing agent, to catalyze a disulfide 
exchange reaction. Preferably, a reagent or combination ofreagents are chosen that result in native disulfide 
bond formation and a reversibly blocked free cysteine residue, Le., the reagent or combination of reagents 
5 acts as cysteine blocking agents. Examples of useful oxidizing reagents include oxygen, cystme, oxidized 
glutathione, cystanune, and dithioglycolic acid. Exan^les of useful redox mixtures include 
cysteine/oxygen, cysteme/cystme, cysteine/cystamine, cysteamine/cystamine, reduced glutatfaione/oxidized 
glutathione, and the like. Optionally, a reducing agent such as DTT or 2-mercaptoethanol can be added to 
the refold mixture to promote disulfide exchange. Optionally, a metal ion such as copper (Cu^ or cobalt 

10 (Co*^ can be added to the refold mixture to promote protein oxidation. Useful concentrations of metal ions 
in the refold mixture are 1 ^iM to 1 mM, with 40 pM bemg a preferred concentration. Preferably, the pH of 
the refold mixture is between pH 6 and pH 1 0. 

Alternatively, the insoluble or aggregated mat«ial (or whole cells without prior cell lysis) is 
rendered soluble or monomeric through the use of a denaturing agent and a disulfide reducing agent that 

15 may or may not be a cysteine blocking agent. Useful denaturing agents include, but are not limited to, those 
described above. Examples of useful disulfide reducing agents include, but are not limited to, DTT, 2- 
mercaptoethanol, sodixmi borohydride, tertiary phosphines and thiols such as cysteine, reduced 
glutathionine, thioglycolic acid and cysteamine. DTT and 2-mercaptoethanol can be used in the range of 0.5 
- 200 mM, with 1-50 mM being preferred concentrations. The denatured and reduced protein is then mked 

20 with a molar excess (relative to the concentration of the reducing agent) of a dithiol reagent that, when 
reduced, can act as a cysteine blocking agent. £xan:q)les of usefid dithiol reagents that can act as (^^ine 
blocking agents when reduced include compounds containing disulfide linkages such as cystine, cystamine, 
oxidized glutathione, dilfaioglycolic acid, 5,5'-difhiobis(2-nitrobenzoic acid (Ellman's reagent), pyridine 
disulfides, compoimds of the type R-S-S-C0-0CH3.where R is an organic compound , other derivatives of 

25 cystine such as diformylcystine, diacetylcystine, diglycylcystine, dialanylcystine diglutammylcystine, 
cystinyldiglycine, cystinyldiglutamine, dialanylcystine dianhydride, cystine phenylhydantoin, homocystine, 
dithiodipropionic acid , dimefhylcystine, or any dithiol or chemical capable of undergoing a disulfide 
exchange reaction. Refolding of the protein is initiated by lowering the concentration of the denaturing 
agent (using the methods described above) and promoting disulfide exchange by addition of a reducing 

30 agent such as cysteine, ditfaiothreitol, 2-mercaptoethanol, reduced glutathione, thioglycolic acid or other 
thiol. Preferrably, a reagent or combination of reagents are chosen that result in native disulfide bond 
formation and a reversibly blocked fi-ee cysteine residue. Optionally, a metal ion such as copper (Cu"*^ or 
cobalt (Co^, can be added to the refold mixture to promote protein oxidation. Optionally, glycerol can be 
added to the refold mixture to increase the yield of refolded protein. Useful concentrations of glycerol in the 

35 refold mixture are 1-50% (volume/volume), with 10-20% being a preferred range. Preferably, the pH of the 
refold mixture is 6- 1 0. 

Although not wishing to be bound by any particular theory, it is believed that the cysteme blocking 
agents used in the present metiiods coval^tiy attach to the 'tiee" cysteine residue, forming a mixed 
disulfide, tiius stabilizing the fi;ee cysteine residue and preventing multimerization and aggregation of the 
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protein. A number of thiol-reactive compounds can be used as cysteine blocking agents to stabilize proteins 
containing firee cysteines. In addition to cysteine, cysteamine, thioglycolic acid and reduced glutathionine, 
cysteine blocking agents can also include reagents containing disulfide linkages such as cystine, cystamine, 
dithioglycolic acid, oxidized glutathione, 5,5'-ditfaiobis(2-nitrobenzoic acid (£llman*s reagent), pyridine 
5 disulfides, compounds of the type R-S-S-CO-OCH3 , other derivatives of cystine such as diformylcystine, 
diacetylcystine, diglycylcystine, dialanylcystine diglutaminylcystine, cystinyldiglycine, cystinyldiglutamine, 
dialanylcystine dianhydride, cystine phenylhydantoin, homocystine, dithiodipropionic acid , 
dimethylcystine, or any dithiol or chemical capable of undergoing a disulfide exchange reaction. Sulfenyl 
halides can also be used to prepare mixed disulfides. Other thiol blocking agents that may find use in 

10 stabilizing proteins containing free cysteine residues include compounds that are able to reversibly react 
with free thiols. These agents include certain heavy metals salts or organic derivatives of zinc, mercury, and 
silver. Other mercaptide forming agents or reversible thiol reactive compounds are described by Cecil and 
McPhee(1959) and Torchinskii (1971). 

Optionally, the refolded, soluble protein containing a free cysteine residue is recovered and isolated 

15 from other proteins in the soluble fraction of the refold mixture. Such recovery and purification methods are 
known or readily determined by those skilled in the art, including, for example, centrifugation, filtration, 
dialysis, chromatography, including size exclusion, ion-exchange, hydrophobic interaction and afBnity 
chromatography procedures and the like. A suitable method for the recovery and purification of a desu^d 
protem will depend, m part, on the properties of the protein and the intended use. 

20 The present mvention also provides novel methods for producing biologically active G-CSF 

protems, particularly wild type G-CSF, G-CSF (C17S), and G-CSF and G-CSF (C17S) variants, mcludmg 
cysteine variants, (collectively referred to as '"G-CSF proteins'O^ that result in a significant mcrease in the 
percentage of the recovered G-CSF proteins that has been properly processed and is biologically active. 
These novel methods include secreting the G-CSF proteins uito the £. coll periplasm using tiie stU signal 

25 sequence, denaturing and refolding the insoluble or aggregated G-CSF iHX)teins, and purifying the soluble, 
refolded G-CSF proteins from other proteins in the soluble fraction of the renaturation/refold mixture. The 
recovered G-CSF proteins lack the non-natural N-terminal methionine residue present when G-CSF proteins 
are expressed intracellularly in E. coh. Published reports (Perez-Perez et al, 1995) describe secretion of G- 
CSF into the E. coli periplasm using a modified ompA leader sequence. However, very little of the 

30 expressed ompA-G-CSF fusion protein was properly processed to yield mature G-CSF. The percentage of 
properly processed G-CSF proteins could be increased to 10-30% of total expressed G-CSF proteins by co- 
expression of the E. coli dnaJ and dnaK proteins. In all cases, the secreted G-CSF proteins were largely 
insoluble and biologically inactive. The methods of the present invention yield at least 80-100% properly 
processed G-CSF proteins and do not require co-expression of the dnaK and dnaJ proteins. The present 

35 invention also provides, for the first time, methods for denaturing and refolding the msoluble, secreted G- 
CSF protems into a biologically active form. 

The purified proteins obtained according to these methods -can be further processed if desired. For 
exan^le, the isolated proteins can be modified at the free cysteine residue witii various cysteine-reactive 
moities. For example, the proteins can be PEGylated at the free cysteme residue with various cysteuie- 
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reactive PEG reagents, and subsequently purified as monoPEGylated proteins. The tenn "monoPEGylated" 
is defined to mean a protein modified by covaient attachment of a single PEG molecule to the protein. Any 
method known to those skilled in the art can be used to purify the PEGylated protein fix>m unmodified 
protein and unreacted PEG reagents, including, for example, the methods described in the Exanqples below, 
5 and in PCT/US98/14497 and PCT/USOO/00931. Examples of other useful cysteine-reactive moieties are 
cysteine-reactive dextrans, cysteine-reactive carbohydrates and cysteine-reactive poly (N- 
vinyIpyrrolidone)s. 

The present invention also provides methods for PEGylating cysteine muteins of GH, G-CSF, GM- 
CSF, alpha interferon and other proteins containing 2N + 1 cysteine residues, and other proteins containing 

10 2N cysteine residues where two or more of the cysteine residues are free, particularly those muteins and 
proteins in which the free cysteine residue is blocked by a mixed disulfide. 

The present invention further relates to purified, monoPEGylated protein variants produced by the 
methods disclosed herein that are not only biologically active, but also retain high specific activity in 
protein-dependent mammalian cell proliferation assays. Such protein variants include, for example, 

15 purified, monoPEGylated cysteine muteins of G-CSF, GH, GM-CSF and IFN-a2. For example, the in vitro 
biological activities of certain of the monoPEGylated G-CSF variants described herein are 3- to 50-fold 
greater than the biological activity of G-CSF that has been PEGylated using amine-reactive NHS-PEG 
reagents. 

There are over 25 distinct IFN-a genes (Pestka et al., 1987). Members of the IFN-a femily share 

20 varying degrees of amino acid homology and exhibit overlapping sets of biological activities. Non-natural 
recombinant IFN-ocs, created through joining together regions of different iPN-a proteins are in various 
stages of clinical development (Horisberger and DMarco, 1995). A non-natural "consensus" interferon 
(Blatt et al., 1996), which incorporates the most common amino acid at each position of IFN-a, also has 
been described. The methods of the present invention also are useful for refolding otiier alpha interf^on 

25 species and non-natural alpha interferon proteins containing a fi:ee cysteine residue. Useful sites and regions 
for PEGylating cysteine muteins of IFN-a2 are directly applicable to other members of the IFN-a gene 
family and to non-natural IFN-as. Kinstler et al. (1996) described monoPEGylated consensus interferon in 
which the protein is preferentially mono PEGylated at the N-terminal, non-natural methionine residue 
through amine or amide linkages. Bioactivity of the PEGylated protem was reduced approximately 5-fold 

30 relative to non-modified consensus interferon QCinstler et al., 1996). 

In one embodiment of the monoPEGylated G-CSF, the polyethylene glycol is attached to the region 
proximal to Helix A of G-CSF and the resulting monoPEGylated G-CSF has an EC50 less than about 1000 
pg/ml (approximately 50 pM), preferably less than about 100 pg/ml (approximately 5 pM), more preferably 
less than about 20 pg/ml (approximately 1 pM) and most preferably less than about about 15 pg/ml 

35 (approximately 0.7 pM). Alternatively, the polyethylene glycol moiety can be attached to the C-D loop of 
G-CSF and the resulting monoPEGylated G-CSF has an EC50 less than about 1000 pg/ml (approximately 50 
pM), preferably less than about 100 pg/ml (approximately 5 pM), more preferably less than about 20 pg/ml 
(approximately 1 pM) and most preferably less than about 15 pg/ml (approximately 0.7 pM). Alternatively, 
tide polyethylene glycol moiety can be attached to the region distal to Helix D of G-CSF and the resulting 



SUBSTITUTE SHEET (RULE 26) 



wo 01/87925 



PCT/USOl/16088 



14 

moiioPEGylated G-CSF has an EC50 less than about 1000 pg/ml (approximately 50 pM), preferably less 
than about 100 pg/ml (approximately 5 pM), more preferably less than about 20 pg/ml (approximately 1 
pM) and most preferably about IS pgAnl (approxinmtely 0.7 pM). Kinstler et al., (1996) described 
monoPEGylated wild type G-CSF in which the protem is preferentially monoPEGylated at the N-teiminal, 
5 non-natural methionine residue through amine or amide linkages. Bioactivity of the monoPEGylated G-CSF 
protein was reported to be reduced approximately 30% relative to non-modij5ed G-CSF, although ECsqs 
were not provided (Kinstler et al., 1996). Kmstler et al. (1996) did not determine whether modifymg other 
amino acids in the region proximal to helix A in G-CSF with PEG resulted in biologically active G-CSF 
protems. One purpose of the present invention is to disclose other amino acid positions in the region 

10 proximal to Helix A, and other regions, in G-CSF where PEG can be attached, resulting in biologically 
active, monoPEGylated G-CSF proteins. 

In one embodiment of the monoPEGylated GM-CSF, the polyethylene glycol is attached to the 
region proximal to Helix A of GM-CSF and the resulting monoPEGylated GM-CSF has an EC50 less than 
about 14000 pg/ml (approximately 1000 pM), preferably less than about 1400 pg/ml (approximately 100 

15 pM), more preferably less than about 280 pg/ml (approximately 20 pM) and most preferably less than about 
140 pg/ml (approximately 10 pM)). Alternatively, the polyethylene glycol moiety can be attached to the B- 
C loop of GM-CSF and the resulting monoPEGylated GM-CSF has an EC50 less than about 14000 pg/ml 
(approximately 1000 pM), preferably less than about 1400 pg/ml (approximately 100 pM), more preferably 
less than about 280 pg/ml. (approximately 20 pM) and most preferably less than about 140 pg/ml 

20 (^proxunately 10 pM)). Alternatively, the polyethylene glycol moiety can be attached to the C-D loop of 
GM-CSF and the resulting monoPEGylated GM-CSF has an ECso less than about 14000 pg/ml 
(q)proximately 1000 pM), preferably less than about 1400 pg/ml (approximately 100 pM), more preferably 
less than about 280 pg/ml (approxunately 20 pM) and most preferably less than about 140 pg/ml 
(approxinaately 10 pM). 

25 In one embodiment of the monoPEGylated GH, the polye&ylene glycol is attached to the region 

proximal to Helix A of GH and the resulting monoPEGylated GH has an EC50 less than about 2000 ng/ml 
(approximately 100 nM), preferably less than about 200 ng/ml (approximately 10 nM), more preferably less 
than about 20 ng/ml (approximately 1 nM) and most preferably less than about 2 ng/ml (approximately 0.1 
nM). 

30 The present invention further provides protein variants that can be covalently attached or 

conjugated to each other or to a chemical group to produce higher order raultimers, such as dimers, trimers 
and tetramers. Such higher order multimers can be produced according to methods known to those skilled 
m the art or as described in Examples 2 and 20. For example, such a conjugation can produce a GH, G- 
CSF, GM-CSF or alpha IFN adduct having a greater molecular weight than the corresponding native 

35 protein. Chemical groups suitable for coupling are preferably non-toxic and non-immunogenic. These 
chemical groups would include carbohydrates or polymers such as polyols. 

The **PEG moiety" useful for attaching to the cysteine variants ci the inresent invention to form 
*TEGyIated" proteins mclude any suitable polymer, for exanq)le, a linear or branched chained polyol. A 
preferred polyol is polyethylene glycol, which is a synthetic polymer composed of ethylene oxide units. The 
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ethylene oxide units can vary such that PEGylatedr^rotein variants can be obtained with apparent molecular 
weights by size-exclusion chromatography ranging from approximately 10,000 to greater than 500,000 kDa. 
The size of the PEG moiety durectly impacts its circulating half-life (Yamaoka et al, 1994). Accordingly, 
one could engineer protein variants with differing circulating half-lives for specific therapeutic applications 
5 or preferred dosing regimes by varying the size or structure of the PEG moiety. Thus, the present invention 
encon^asses GH protein variants having an apparent molecular weight greater than about 30 kDa, and more 
preferably greater than about 70 kDa as determined by size exclusion chromatography, with an EC50 less 
than about 400 ng/ml (18 nM), preferably less than 100 ng/ml (5 nM), more preferably less than about 10 
ng/ml (0.5 nM), and even more preferably less than about 2.2 ng/ml (0.1 nM). The present invention further 

10 encompasses G-CSF protein variants having an apparent molecular weight greater than about 30 kDa, and 
more preferably greater than about 70 kDa as determined by size exclusion chromatography, with an EC50 
less than about 100 ng/ml (5 nM), preferably less than 1000 pg/ml (50 pM), more preferably less than 100 
pg/ml (6 pM), and even more preferably less than about 15 pg/ml (0.7 pM). The present invention further 
encompasses alpha IFN (IFN-a) protein variants having an apparent molecular weight greater than about 30 

15 kDa, and more preferably greater than about 70 kDa as determined by size exclusion chromatography, with 
an IC50 less than about 1900 pg/ml (100 pM), preferably less than 400 pg/ml (21 pM), more preferably less 
than 100 pg/ml (5 pM), and even more preferably less than about 38 pg/ml (2 pM). The present mvention 
further encompasses GM-CSF protein variants having an apparent molecular weight greater than about 30 
kDa, and more preferably greater than about 70 kDa as determined by size exclusion chromatography, with 

20 an EC50 less than about 14,000 pg/ml (-1000 pM), preferably less than 1400 pg/ml (-100 pM), more 
preferably less than 280 pg/ml (20 pM), and even more preferably less than about 140 pg/ml (- 1 pM). 

The reactive PEG end group for cysteine modification includes but is not limited to vinylsulfone, 
maleimide and iodoacetyl moieties. The PEG ^d group should be specific for thiols witii the reaction 
occurring under conditions that are not detrimental to the protein. 

25 Antagonist hOH variants also can be prepared using a cysteine-added variant GH as described in 

PCT/US98/14497 and PCT/US/00/00931. Conditions that would benefit firom the administration of a GH 
antagonist include acromegaly, vascular eye diseases, diabetic nephropathy, restenosis following angioplasty 
and growth hormone responsive malignancies. 

As used herein, the term "derivative" refers to any variant of a protein expressed and recovered by 

30 the present methods. Such variants include, but are not limited to, PEGylated versions, dimers and other 
higher order variants, amino acid variants, truncated variants, fusion proteins, changes in carbohydrate, 
phosphorylation or other attached groups found on natural proteins, and any other variants disclosed herein. 

The compounds produced by the present methods can be used for a variety of in vitro and in vivo 
uses. The protems and their derivatives of the present invention can be used for research, diagnostic or 

35 therapeutic purposes that are known for their wildtype, natural, or previously known modified counterparts. 
In vitro uses include, for example, tiie use of the protein for screening, detecting and/or purifying other 
proteins. 

For therapeutic uses, one skilled m the art can readily determine the ^propriate dose, fiiequency of 
dosing and route of administration. Factors in making such determinations include, without limitation, the 
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nature of the protein to be administered, the condition to be treated, potential patient compliance, the age 
and weight of the patient, and the like. The compounds of the present uivention can also be used as delivery 
vehicles for enhancement of the circulating half-life of the ther^eutics that are attached or for directing 
delivery to a specific target withm the body. 
5 The following exanq)les are not intended to be limiting, but only exenq>lary of specific 

embodiments of the invention. 

Examples 

10 Example 1 

Refolding of the Growth Hormone Mutein T3C 

Methods for expressing, purifying and determining the in vitro and in vivo biological activity of 
recombinant human Growth Hormone (hGH) and hGH cysteine muteins are described in PCTAJS98/14497 

15 and PCT/US/00/00931. Methods for constructing cysteine muteins of hGH also are described in 
PCTAJS98/14497 and PCT/US/00/00931. One preferred method for expressing hGH in E. coli is to secrete 
the protein into the periplasm using the STU leader sequence. Secreted hGH is soluble and can be purified 
by column chromatography as described in PCT/USOO/00931. Certain cysteine muteins of hGH remain 
insoluble when secreted into the E. coli periplasm using the STII leader sequence. Procedures for refolding 

20 insoluble, secreted hGH proteins have not been described previously. The following protocols were 
developed to refold insoluble hGH cysteine muteins into a biologically active fona 

The insoluble GH T3C mutein (threonine at position 3 changed to cysteine; described in 
PCT/US98/14497 and PCT/US/00/00931) was expressed in E. coli as a protein secreted to the periplasmic 
space using the stn leader sequence as described in PCT/USOO/00931. The T3C protein was solubilized 

25 and refolded using the following two procedures, both of which use cysteine as a reducing agent and as a 
cysteine blocking agent to stabilize the fi:ee cysteine residue. Cultures (200 ml) of an coli stram 
expressmg tiie T3C mutein were grown and expression of T3C was induced as described in 
PCT/USOO/00931. The cells were lysed and the insoluble portion was isolated by centrifugation as 
described in Example 14. The msoluble material containmg T3C was dissolved m 20 noL of 8 M urea, 20 

30 mM cysteine, 20 mM Tris pH 9 and mbced by shaking for 1 hour at room temperatiue. The solub^ization 
mixture was next divided into two, with half bemg diluted into 50 mL of 10% glycerol, 20 mM Tris, pH 8 
and flie other half bemg diluted into 50 mL of 0.5% TWEEN 20, 20 mM Tris, pH 8. The refolds were held 
at 4**C for 24 hours before being clarified by centrifiigation and loaded onto a 5 mL Q-Sepharose Hi Trap 
column previously equilibrated in 20 mM Tris, 0,5% Tween 20, pH 7.6. Refolded, soluble T3C was eluted 

35 fi-om the column durmg a 20 column volume gradient of 0-300 mM NaCl in 0.5% Tween 20, 20 mM Tris 
pH 7.6. Recovered column fractions were analyzed by non-reducing SDS-PAGE. Monomeric T3C eluted 
at around 160 mM NaCl. Approximately 790 ^ig of monomeric T3C were recovered from the refold 
containing glycerol in the renaturation buffer. Apiwroxmiately 284 fig of monomeric T3C was recovered 
from the refold when Tween 20 was present in the renaturation buffer. The results indicate that soluble, 

40 monomeric T3C protein can be obtained using -either refold/renaturation procedure. Based on the greater 
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recovery yields of monomeric T3C protein, glycerol was used as a stabilizing agent in subsequent refold 
experiments. 

Example 2 

5 Comparison of reducing agents used to refold the Growth Hormone T3C Mutein 

Cultures (200 niL) of an E. coli strain expressing the T3C mutein were grown and T3C expressed 
as described in PCT/US00/0093L Insoluble T3C was isolated by lysing the cells with detergent/lysozyme 
treatment of the cells as described in Examples 5 and 14 . This material was suspended in 20 mL of 8 M 
urea, 20 mM Tris pH 9 and aliquoted mto 3 tubes. No reducing agent was added to the first tube ("Refold 

10 A"), 5 mM DTT was added to the second tube ('"Refold B") and 20 mM cysteine was added to the third tube 
(**Refold C"), After one hour of mixing at room temperature, the solubilizations were diluted into 30 mL of 
10% glycerol, 20 mM Tris, pH 8. The refolds were held at 4°C overnight. The next day, the refolds were 
clarified by centrifiigation and loaded onto 5 mL Q-Sepharose Hi Trap columns as described in 
PCT/USOO/00931. Recovered firactions were analyzed by non-reducmg SDS-PAGE. The T3C protein 

15 recovered fix)m **Refold A" (no reducmg agent) eluted as seyeral broad peaks fsom the Q-Sepharose 
column. By SDS-PAGE, the recovered protem product had some monomeric T3C protein present, but 
consisted mostly of aggregated T3C dimers (eluting at 210 mM NaCl) and T3C multimers (elutuig between 
300 mM to 1000 mM NaCl). Final recoveries of monomeric and dimeric T3C proteins are shown in Table 
1. the T3C protein recovered firom "Refold B" (with 5 mM DTT) eluted as a single broad peak firom the 

20 Q-Sepbaiose column, but was heterogeneous by non-reducing SDS-PAGE analysis. The monomeric T3C 
band was much broader than the pituitary hGH band and comprised a number of different molecular weight, 
monomeric species, which probably represent different disulfide isoforms of T3C. A small amount of 
dimeric T3C protein was also detected in several of the fiiactions. "Refold C" (with cysteine as the reducing 
agent) yielded mainly monomeric T3C protein, which appeared to be a single homogeneous species, as 

25 evidenced by the sharpness of the peak eluting firom the Q-Sepharose column at 160 mM NaCl and by the 
sharpness of the protein band at the correct molecular weight (relative to the standard pituitary hGH) when 
analyzed by non-reducing SDS PAGE. Final recoveries of monomeric and dimeric forms of T3C firom 
each of the refolds are given in Table 1. The data indicate that solubilizing/refolding the T3C protein in the 
presence of cysteine results in greater yields of soluble monomeric T3C protein than does 

30 solubilizing/refolding the protein in the absence of a reducing agent or in the presence of DTT. The results 
also indicate that solubilizing/refolding the T3C protein in the presence of cysteine yields a more stable, 
homogeneous preparation of soluble, monomeric T3C protein than does solubilizing/refolding the protein in 
the absence of a reducing agent or in the presence of DTT. 
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Table 1 

Recoveries of T3C Proteins Prepared Using Various Refold Procedures 



Refold 


Reducing Agent 


Monomeric T3C protein 
Yield (ng)*^ 


Dimeric T3C protein 
Yield (ug)"^ 


A 


none 


30 


120 


B 


SmMDTT 


370 


25 


C 


20 naM Cysteine 


534 


225 



" Protem recov^ed per 66 ml of E. coli culture 



5 

The monomeric T3C protein recovered from the Refold B, which contained DTT in the 
solubilization mixture, can be converted to stable, disulfide-linked homodimeric T3C protein by placing the 
protein under conditions that allow for disulfide bond formation. These include conditions where an 
oxidizing agent is added to the protein, or by the addition of a second disulfide-linked reagent that is 

10 capable of undergoing a disulfide rearrangements when the pH is near neutral or alkali. Examples of 
oxidizing agents that could be used include sodium tetrathionate or oxygen. Optionally, trace amounts of 
divalent metal ionis such as copper or cobalt can be added to catalyze the reaction. Usefiil disulfide-linked 
reagents mclude cystine, cystamine, oxidized glutathione, dithioglycolate, or other low molecular weight 
dithiols. Alternatively, monomeric T3C protein can be held at an acidic pH to prevent aggregation and 

IS unwanted disulfide rearrangements. 

The soluble, refolded GH cysteine muteins prepared according to the procedures described in 
Examples 1 and 2 can be purified by various chromatography procedures known to those of skill in the art. 
These chromatographic procedures include ion exchange, size exclusion, hydrophobic interaction (HIC), 
metal chelation affinity chromatographies (IMAC), Size Exclusion Chromatography (SEC) , Reversed Phase 

20 chromatography or a combination of these techniques. As one example, the GH muteins can be captured 
from the soluble fraction of the refold mixture using a Q-Sepharose fast flow resin (Pharmacia) equilibrated 
in 20 mM Tris-HCl, pH 8.0. The column can be washed with 20 mM Tris-HCl, pH 8.0 and bound proteins 
eluted with a linear 10-20 volume increasing salt gradient from 0 to 250 mM NaCl in 20 mM Tris-HCl, pH 
8.0. Optionally, Glycerol (10% final concentration) -can be added to the column buffers. Fractions 

25 contaming the hGH muteins can be identified by SDS-PAGE and West^ blotting. Alternative resms that 
can be used to capture hGH muteins Scorn die soluble fraction of the refbld/renaturation mixture include 
HIC, other ion exchange resins or afiinity resins. 

The cysteme muteins can be purified fiirther by hydrophobic mteraction chromatography. Q- 
Sepharose column fractions containing the GH muteins can be pooled and NaCl added to a final 

30 concentration of 2 M. The pool can be loaded onto a Butyl- Seph£ux>se fest flow resin previously 
equiUbrated in 2 M NaCl» 20 mM sodium phosphate, pH 7.5. GH muteins can be eluted from the resin 
using a reverse salt gradient from 2 M to 0 M NaCl in 20 mM pho^hate, pH 7.5. Fractions containing the 
GH muteins can be identified by SDS-PAGE and Western blotting, and pooled. Alternatively the Q- 
sepharose fractions -containing the GH muteines can be pooled and ammonium sul&te added to a final 

35 concentration of 2 M before being loaded cmto a I^enyl -Sepharose column. The GH muteins can be 
eluted from the lesin using a reverse salt gradient from 2 M to 0 M ammonium sulfate in 20 mM sodium 
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phosphate, pH 7.S. Fractions containing the GH muteins can be identified by SDS-PAGE and Western 
blotting, and pooled. 

If further purification is desired, the HIC pool containing the GH muteins can be loaded directly 
onto a nickel chelating resin (Qiagen) equilibrated in 10 mM sodium phosphate, 6.5 M NaCl, pH 7.5. 
5 Following a wash step, the GH muteins can be recovered using a 0 - 30 mM imidizole gradient in 10 mM 
sodium phosphate, 0.5 M NaCl, pH 7.5. GH has a high afiSnity for nickel, presumably through the divalent 
metal-binding site formed by HI 8, H21 and El 74. As a result, GH can be obtained in highly piu*e form 
using a metal chelation column (Maisano et aL, 1989). The GH muteins will bind tightly to the nickel 
column and elute at similar imidazole concentrations (around 15 mM) as wild-type GH. Alternatively a 

10 copper chelating column may be used in place of a nickel chelating column. 

Biological activities of the purified GH cysteme muteins can be measured using the cell 
proliferation assay described in PCT/USOO/0093L Protein concentrations can be determined using a 
Bradford dye binding assay (Bio-Rad Laboratories). 

The T3C mutein was purified as follows. A 600 mL culture ofE. coli was grown and T3C protein 

15 expression induced as described above. Insoluble T3C was isolated by treating the cells with a 
detergent/lysozyme mixture (B-Per^", Pierce) as described in Examples 5 and 14. The insoluble material 
was suspended in 40 mL of 8 M urea, 20 mM Tris, 20mM Cysteine, pH 9. After one hour of mixing at 
room temperature, the solubilization mixture; was diluted into 200 mL of 15% glycerol, 20 mM Tris, pH 8, 
40 fiM copper sulfate. The refold was held at 4*'C overnight. The next day, the refold was clarified by 

20 centrifugation and loaded onto a 5 mL Q-Sepharose Hi Trap column equilibrated in 10% glycerol, 20 mM 
Tris, pH 8. T3C was recovered by elution with a 20 column volume gradient from 0-250 mM NaCl in 20 
mM Tris, pH 8, 10% glycerol. Recovered fractions were analyzed by non-reducing SDS-PAGE. Fractions 
containing predominantly T3C protein of the coirect apparent molecular weight were pooled. Pooled 
fractions yielded 4.6 mg of purified T3C protein. This material was uised for the PEGylation studies 

25 described in Example 3. Biological activity of the purified T3C protein was measured in the GH-R4 cell 
proliferation assay described in Examples 1 and 2 and PCT/US98/14497 and PCT/USOO/00931. The T3C 
protem stimulated proliferation of the GH-R4 cells with an EC50 of 1 .35 ng/ml. 

Other cysteine muteins of GH that were prepared by this procedure include *-lC, P2C, P5C, 
K38C, Q40C, K41C, S55C, S57C, T60C, Q69C, N72C, N99C, LIOIC, V102C, Y103C, D130C, S132C, 

30 P133C, R134C, T135C, Q137C, K140C, Q141C, T142C, Y143C, K145C, D147C, N149C, S150C, H151C, 
N152C, D153C, E186C, and G187C. Biological activities of certain of the purified GH cysteine muteins 
were measured in the GH-R4 cell proliferation assay described in PCT/USOO/00931, The observed EC50S 
for muteins *-lC, P2C, P5C, K38C, Q40C, S55C, N99C, LIOIC, V102C, Y103C, P133C, Q137C, Ki40C, 
Y143C, D147C, N149C, E186C, and G187C ranged from 0.7 ng / ml to 2.2 ng/ml . These values are all 

35 nearly equivalent to the observed EC50S for wild type GH controls in these assays which ranged from 0.3 ng 
/ml to 1,5 ng/ml . 
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Example 3 

General Methods for PEGylatlon and Purifying PEGylated Forms of Proteins Containing Free 

Cysteine Residues 

Proteins containing &ee cysteine residues can be PEGylated using a variety of cysteine-reactive 
5 PEG-maleimide (or PEG-vinylsulfone) reagents that are commercially available. The recombinant proteins 
are generally partially reduced with dithiothreitol (DTT), Tris (2-carboxyethyl) phosphine-HCl (TCEP) or 
some other reducing agent in order to achieve optimal PEGylation of the free cysteine. The free cysteine is 
relatively unreactive to cysteine-reactive PEGs unless this partial reduction step is performed. The amount 
of reducing agent required to partially reduce each mutein can be determined empirically, using a range of 

10 reducing agent concentrations at different pHs and temperatures. Reducing agent concentrations typically 
vary from 0.5 equal molar to 10-fold molar excess. Preferred temperatures are 4°C to 3TC. The pH can 
range from 6.5 to 9.0 but is preferrably 7.5 to 8.5. The optimum conditions will also vary depending on the 
reductant and time of exposure. Under the proper conditions, the least stable disulfides {typically 
intermolecular disulfides and mixed disulfides) are disrupted first rather than the more thermodynamically 

15 stable native disulfides. Typically, a 5-10 fold molar excess of DTT for 30 minutes at room temperature is 
effective. Partial reduction can be detected by a slight shift in the elution profile of the protein from a 
reversed-phase column. Partial reduction also can be detected by a slight shift in apparent molecular weight 
by non-reducing SDS-PAGE analysis of the protein sample. Care must be taken not to "over-reduce" the 
protein and expose additional cysteine residues. Over-reduction can be detected by reversed phase-HPLC 

20 (the over-reduced protein will have a retention time similar to the folly reduced and denatured protein) and 
by the appearance of protein molecules containing two PEGs following the PEGylation reaction (detectable 
by an apparent molecular weight change on SDS-PAGE). In the case of cysteine muteins, the corresponding 
wild type proteui can serve as a control since it should not PEGylate under conditions that do not reduce die 
native mtramolecular disulfides. Excess reducing agent can be removed prior to PEGylation by size 

25 exclusion chromatography or by dialysis. TCEP need not be removed before addition of the PEGylation 
reagent as it is does not contain a free thiol group. The partially reduced protein can be reacted witii various 
concentrations of PEG-maleimide or PEG-vinylsulfone (typically PEG: protem molar ratios of 1:1, 5:1,10:1 
and 50:1) to determine the optimum ratio of the two reagents, PEGylation of the protein can be monitored 
by a molecular weight shift for example, using SDS-PAGE. The lowest amount of PEG that gives 

30 significant quantities of mono-pegylated product without giving di-pegylated product is typically considered 
desirable. In some instances, certain additives can enhance the PEGylation jdeld. These additives include, 
but are not limited to, EDTA, borate, chaotropes (urea, guanidine, organic solvents), detergents, osmolytic 
stabilizers (polyols, sugars, polymers, amino acids and derivatives thereof), and other ionic compounds 
(citrate, sulfates, phosphates, quaternary amines, chlorides nitrates, thiocyanates, etc.) Usefiil 

35 concentrations of EDTA are 0.01 - 10 mM, with 0.5 - 1 raM being preferred concentrations. Generally, 
mono-PEGylated protein can be purified from non-PEGylated protein and unreacted PEG by size-exclusion, 
ion exchange, affinity, reversed phase, or hydrophobic interaction chromatography. Fractions enriched for 
the mono-PEGylated protein ( a single PEG molecule attached to the cysteine mutein) can be identified by 
SDS-PAGE and/or Western -blotting. These fractions can be pooled and stored frozen. The presence of die 
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PEG moiety generally alters the protein's aflSnity for the resin, allowing the PEGylated protein to be 
separated from tiie non-PEGylated protein. Other purification protocols such as 2-phase organic extraction 
or salt precipitation also can be used. The purified, PEGylated protein can be tested in the cell 
proliferation/inhibition assays described in the various Examples described herein and in PCT/US98/14497 
5 and PCT/lJSOO/00931 to determine its specific activity. In vivo efficacy of the PEGylated proteins can be 
determined as described in the Exan^les provided herein and m PCT/US98/14497 and PCT/USOO/00931. 
Experiments can be performed to confum that the PEG molecule is attached to the protein at the proper site. 
This can be accomplished by chemical or proteolytic digestion of the protein, purification of the PEGylated 
peptide (which will have a large molecular weight) by size exclusion, ion exchange or reversed phase 
10 chromatography, followed by amino acid sequencing. The PEG-coupled amino acid will appear as a blank 
in the amino acid sequencing run. 

The following conditions were used to PEGylate the GH mutein T3C and to purify the PEGylated 
T3C protein. Initial PEGylation reactions conditions were determined using aliquots of the refolded T3C 
protein prepared as described in Example 2 (using cysteine as the reducing agent and as the cysteine 
15 blocking agent to solubilize and refold the protein), TCEP [Tris (2-carboxyethyl) phosphine]-HCl as the 
reducing agent and 5kDa cysteine reactive PEGs firom Shearwater Polymers (Huntsville, Alabama). Two ^g 
aliquots of purified T3C were incubated with increasing concentrations of TCEP at room temperature in 100 
mM Tris, pH 8.5 in the presence of varymg amounts of excess 5 kDa maleimide-PEO or 5 kDa 
vinylsulfone-PEG. After 120 minutes, aliquots of the reactions were unmediately analyzed by non- 
20 reducing SDS-PAGE. At pH 8.5, a 5-fold molar excess of TCEP and 1 5-fold excess molar of either 5 kDa 
maleimide or 5 kDa vinyl sulfone PEG yielded significant amounts of monoPEGylated T3C protein after 
two hours without detectable di or tri-PEGylated protein. The T3C mutein needed to be partially reduced 
by treatment with a reductant such as TCEP in order to be PEGylated. Wild type GH did not PEGylate 
under identicd partial reducing conditions, indicating that the PEG moiety is attached to the cysteine residue 
25 introduced into the mutein. These conditions were used to scale the PEGylation reaction for purification 
and evaluation of biological activity. A larger PEGylation reaction (300 \xg) was performed for 2 hr at room 
temperature, using a 5-fold excess of TCEP and 15-fold of 10 kDa maleimide PEG. At the end of the 
reaction time, the PEGylation mbcture was diluted 2X with ice cold 20 mM Tris, 15% glycerol, pH 8.0 and 
immediately loaded onto a Q-Sepharose column (1 mL, HiTrap). PEGylated T3C was eluted firom the 
30 column by running a 20 mL gradient from 0-0.2 M NaCl in 20 mM Tris, 15% glycerol, pH 8. The presence 
of the PEG moiety decreases the protein's affinity for the resin, allowing the PEGylated protein to be 
separated from the non-PEGylated protein. Fractions enriched for morio-PEGylated T3C (a single PEG 
molecule attached to the T3C monomer) were identified by SDS-PAGE, pooled and frozen. The mono- 
PEGylated T3C protein eluted at approximately 80 mM NaCl and its apparent molecular weight by SDS- 
35 PAGE was approximately 30 kDa. 

lOK PEG-T3C, 20K PEG-T3C, and 40 K PEG-T3C were also prepared by the method described 
above. Bioactivity of the purified PEG-T3C proteins were measured in the cell proliferation assay 
described in Examples 1 and 2 and PCT/US98/14497 and PCT/US/00A)0931 to determine its qjecific 
activity. TbQ PEG-T3C proteins stimulated proiif«ation of GH-R4 cells similar to wild type GH and non- 
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PEGylated T3C protein. The EC50 for the 5K PEG-T3C protein was 1.2 ng/ml, the ECso for the lOK PEG- 
T3C was 1.2 ng/ml, and the EC50 for the 20K PEG-T3C was 3-4 ng/ml. The EC50 for the 40K- PEG-T3C 
can be determined using the cell proliferation assay described in Examples 1 and 2 and PCT/US98/14497 
and PCT/US/00/00931. In vivo efficacy of PEG-T3C and other PEGylated GH cysteine muteins can be 
5 determined as described in PCT/US98/14497 and PCT/USOO/00931 and Exan5)le 4. 

Other cysteine mutants of GH that were PEGylated and purified according to the procedures 
oudined above include P2C, P5C, S132C, P133C, and R134C. The biological activities of these muteins 
that were modified with 20 kDa- PEG moieties were measured using the cell proliferation assay described in 
Examples 1 and 2 and PCT/US98/14497 and PCT/US/00/00931. The observed EC50S for these PEGylated 
10 muteins muteins ranged from 1.7 ng / ml to 6.0 ng / ml. These values are all similar to, but slightly greater 
than, the observed EC50S for wild type GH control assays that were performed in parallel. The EC50S for 
these wild type GH controls ranged from 0.6 ng / ml to 1 .2 ng / ml . 

Example 4 

15 PE&-T3C Growth Hormone Stimulates Somatic Growth in Growth Hormone-Deficient Rats 

A The ability of PEG-T3C to stimulate somatic growth was determined in hypophysectomized 
(HYPOX) rats, which are unable to synthesize growth hormone due to removal of their pituitaries. HYPOX 
male Sprague-Dawley rats were purchased from a commercial vendor and weighed about 90 g. The rats 
were acclimated for 13 days. Animals gaining more than 4 g during accliniation were culled from the study. 

20 Body weight measurements were taken at the same time every day (9:30 AM). Rats were randomized by 
weight to the various test groiQ)s. There were 5 rats per group except for the group receiving every day 
doses of 20 kDa-PEG-T3C, in ^ch there were only four rats. Rats were weighed daily and were given 
daily or every other day subcutaneous injections of placebo (Phosphate Buffered Saline (PBS) containing 
200 ^g/ml rat serum albumin (Sigma Chemical Company)), a commercial recombinant human growth 

25 hormone, Nutropin® , or various doses of 20 kDa-PEG-T3C prepared as described in Example 3. All 
protein solutions were prepared in PBS contaming 200 ^g/ml rat serum albumin. Animals were treated for 
9 consecutive days. On day 10, the animals were sacrificed and their tibias were harvested. The tibias were 
fixed in 10% neutral buffered formalin. The fixed tibias were decalcified in 5% formic acid and split at the 
proximal end in the fix)ntal plane. The tibias were processed for parafGn embedding and sectioned at 8 

30 microns and stained with toluidine blue. The width of the tibial physis was measured on the left tibia (5 
measurements per tibia). Cumulative body weight gain and tibial epiphyses measurements for the different 
test groups are shown in Table2. The results show that 20 kDa-PEG-T3C stimulates body weight gain and 
bone growth in growth hormone deficient rats. 



SUBSTITUTE SHEET (RULE 26) 



wo 01^7925 



PCTAJSOl/16088 



23 



Table! 

Effects of every day or every other day administration of placebo, Nutropin or 20 kDA-PEG-T3C on 
body weight gain and tibial epiphyses width in hypophysectomized rats 



Con:q>ound 


Dose 


Injection 
Frequency 


Cumulative Body 
Weight Gain 
(grams) 


Tibial Epiphyses Width 
(mean+/-SE) 

(m) 


Placebo 




Every day 


-1.0+/- 0.707 


206.8+/- 9.2 


Nutropin 


10 pig/injection 


Every day 


11.2+/. 0.97'^ 


348.8+/- 8.6" 


20kDa.PEG-T3C 


10 iig/injection 


Every day 


14.3+/- 0.75° 


333.0+/- 9.8'' 


Placebo 




Every other day 


0.6+/- 1.03 


204.4+/. 8.6 


Nutropin 


10 ^ig/injection 


Every other day 


8.6+/- 1.12^ 


298.8+/- 10.1 " 


20 kDa-PEG-T3C 


10 |ig/injection 


Every other day 


15.4+/- 0.68 b°'' 


357.2+/. 7.7' 


20kDa.PEG-T3C 


2 M,g/injection 


Every other day 


5.6+/- 0.51^ 


274.8+/- 9.0' 


20kDa-PEG-T3C 


0.4 jig/injection 


Every other day 


-0.2+/- 0.66 


225.2+/- 10.0' 



5 ° p< 0.05 versus every day placebo using a two-tailed T test 



p< 0.05 versus every other day placebo using a two-tailed T test 
p< 0.05 versus every other day Nutropin using a two-tailed T test 

B. A second experiment was performed as described for Example 4 A. except that the test 
10 compounds were administered by subcutaneous injection every day or every third day. In addition, one 
dose of T3C modified with a 40 kDa-PEG was tested. HYPOX male Sprague-Dawley rats were purchased 
froma commercial vendor and weighed about 100 g. Body weight measurements were taken at the same 
time every day. Rats were randomized by weight to the various test groups. There were 5 rats per group 
except for the group, except for tiie test group receivmg 40 kDa-PEG-T3C. Rats were weighed daily and 
15 were given daily or every diird day subcutaneous injections of placebo (Phosphate Buffered Saline (PBS) 
contaming 200 ^g/ml rat serum albumin (Sigma Chemical Company)), a commercial recombinant human 
growth hormone, Nutropin®, various doses of 20 kDa-PEG-T3C or 40 kDa-PEG-T3C. The PEG-T3C 
proteins were prepared as described in Example 3. All protein solutions were prepared in PBS contaming 
200 fig/ml rat serum albumin. Animals were treated for 9 consecutive days. On day 10, the animals were 
20 sacrificed and their tibias were harvested and prepared for sectioning as described in Example 4^.. 
Cumulative body weight gain and tibia epiphyses widths for the diffimnt test groiq)S are shown in Table 3. 
The results show that 20 kDa-PEG-T3C and 40 kDa-PBG-T3C stimulate body weight gain and bone growth 
in growth hormone. deficient rats. 
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Tables 

Effects of every day or every third day administration of placebo, Nutropin, 20 kDA-PEG-T3C or 40 
kDa-P£G-T3C on body weight gain and tibial epiphyses width in hypophysectomized rats 



Compound 


Dose 


Injection 
Frequency 


Cumulative Body 
Weight Gain 
(grams) 


Tibial Epiphyses Width 

(mean+/-SE) 

(Hm) 


Placebo 




Every Day 


0.8 +A 0.685 


223+/- 15.1 


Nutropin 


30 lig/injection 


Every day 


21.3 +A 1.432 


408.4+/- 14.2 


Nutropin 


10 )ig/injection 


Every Day 


16.2 +A 1.232 


399.6+/- 15.6 


20kDa-PEG-T3C 


10 ^g/injection 


Every Day 


18.6 +A 2.215 


384.4+/- 13.0 


Placebo 




Every third day 


1.5 +/- 1.370 


231.6+/- 17.4 


Nutropin 


30 )ig/injection 


Every third day 


6.8+/. 1.385 


315.2+/- 15.6 


Nutropin 


10 lig/injection 


Every third day 


8.0+/- 1.614 


284.0+/. 6.9 


20 kDa-PEG-T3C 


30 |.ig/injection 


Every third day 


17.5+/- 1.162 


428.4+/- 18.3 


20 kDa-PEG-T3C 


10 |ig/injection 


Every third day 


12.3 +/- 0.792 


329.2+/. 15.6 


20 kDa-PEG-T3C 


2 jig/injection 


Every third day 


8.0+/- 1.379 


263.2+/. 7.1 


40kDa-PEG-T3C 


10 |ig/injection 


Every third day 


17.2+/- 0.868 


360.5+/. 21.9 



Example 5 

Refolding and Purification of IFN-a2 Cysteine Muteins 

Methods for expressing, purifying and determining the in vitro and in vivo biological activity of 

10 recombinant human alpha interferon 2 (IFN-ot2) and IFN-cc2 cysteine muteins are described in 
PCT/USOO/00931. Methods for constructing cysteine muteins of IFN-a2 and preferred sites within the 
IFN-(x2 protein for the locations of added cysteine residues also are described in PCT/US98/14497 and 
PCT/USOO/00931. The following muteins have been constructed in E coli usmg those methods: CIS, Q5C, 
43C44, N45C, Q46C, F47C, Q48C, A50C, D77C, C98S, QIOIC, T106C, E107C, T108C, S163C, E165C, 

15 ♦166C, D2C, L3C, T6C, S8C, T52C, G102C, V103C, G104C, VIOSC, P109C, LUOC. MlUC, S160C, 
L161C, R162C and K164C. One preferred method for e3q}ressing ]FN-a2 in E. coli is to secrete the protein 
mto the periplasm using the STII leader sequence. A fraction of the secreted IFN-a2 is soluble and can be 
purified by column chromatography as described in PCT/USOO/00931. Certain cysteine muteins of IFN-a2 
remain insoluble when secreted into the E. coli periplasm using the STU leader sequence. SDS-PAGE 

20 analysis of the osmotic shock supematants of the muteins showed most to have reduced tas compared to 
wild type) levels of the 19 kDa rIFN-a2 band. SDS-PAGE analyses of whole cell lysates and the msoluble 
material from the osmotic shocks revealed that these muteins were expressed at relatively high levels but 
accumulated primarily in an insoluble form, presumably in the periplasm. These proteins comigrated with 
wild type rIFN-a2 standards under reducing conditions indicatmg that the STII leader had been removed. 

25 Qualitative assessments of relative expression levels of the muteins are summarized in Table 4. Procedures 
for refolding insoluble, secreted IFN-a2 proteins have not been described previously. The following 
protocol (here referred to as 'Trotocol I") was developed to express and refold IFN-a2 cysteine muteins 
into a biologically active form. 
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For expression of IFN-a2 cysteine muteins and IFN-a2, typically, a 325 ml culture in a 2 liter 
shake flask, or a 500 ml culture in a 2 liter bafQed shake flask, were grown at 37°C in a gyrotory shaker 
water bath at '-1 70-220 rpm. Cultures were grown, induced, harvested, and subjected to osmotic shock as 
described in PCT/USOO/00931. Resulting supematants and pellets were processed immediately or stored at 
5 -80°C. 

IFN-a2 cysteine muteins that were recovered as insoluble proteins in the osmotic shock pellets 
were denatured, reduced and refolded into their proper conformations usmg the foUowmg refold procedure. 
The pellet fiom the osmotic shock lysate was first treated with B*P£R ™ bacterial protein extraction 
reagent as described by the manufacturer (Pierce). B-PER is a mild detergent mixture that disrupts the E. 

10 coli membranes and releases the cytoplasmic contents of the cells. Insoluble material was recovered by 
centrifugation, resuspended in water, and recentrifiiged. The resulting pellet was solubilized in 5 mL of 6 M 
guanidine, 50 mM cysteine in 20 mM Tris Base. The mixture was allowed to stir for 30 minutes before 
being dialyzed overnight at 4°C against 400 mL of 40mM sodium phosphate, 150 mM NaCl, pH 8.0. The 
next day the pH of the refold mixture was adjusted to 3.0 and the mixture was centrifuged before being 

15 loaded onto an S-Sepharose column, followed by a Cu*^ MAC column as described for the purification of 
rIFN-a2 from the osmotic shock supematant in PCT/USOO/00931. Six IFN-a2 cysteme muteins: Q5C, 
C98S, QIOIC, T106C, E107C and *166C have been refolded and purified using these procedures. Similar 
procedures can be used to refold and purify insoluble wild type IFN-a2. 

Non-reducing SDS-PAGE analysis of purified Q5C, C98S, QIOIC, T106C, E107C, and *166C 

20 cysteine muteins showed that the muteins were recovered predominantly as monomers, migrating at the 
expected molecular weight of ~ 19 kDa. C98S migrated with a slightly higher molecular weight than the 
other rINF-a2 muteins due to the absence of the native Cysl-Cys-98 disulfide bond. Some of the purified 
muteins contained small amounts of disulfide-linked rIFN-a2 dimers. The molecular weights of the dimer 
species were approximately 37-38 kDa. 

25 When processing a number of cyteme muteins of IFN-<x2, it was discovered that certain cysteine 

muteins q)peared to be present in both the soluble and insoluble fractions following cell lysis. Ratios of 
soluble verus insoluble IFN-a2 protein varied from mutant to mutant. Therefore, an alternative 
solubilzation/refolding procedure (here referred to as '^Protocol n") that involves a whole cell solublization 
step was developed to enhance recovery of the IFN-a cysteine muteins. A modification of the culture 

30 methods was found to unprove the efficiency of processing of the STII leader sequence and was eiiq)loyed 
to express IFN-a cysteme muteins for refolding and purification, as detailed below. In the modified 
method, 325 - 400 ml cultures were grown in LB media containing 100 mM MES, pH 5.0 and 100 ng/ml 
an^icillin at 37'^C with vigorous shaking, e.g., 220-250 rpm m a New Brunswick C25KC environmental 
shaker, to a cell density of 0.5 - 0.7 OD at 600 nm. Cultures were then induced by addition of IPTG 

35 (isopropyl-P-D-thiogalactopyranoside) to a final <:oncentation of 0.5 mM and upon induction the 
temperature was reduced to 28^C and the shaker speed was reduced io 140 rpm. Induced cultures were 
incubated ovemigjht (14-18 hours) and harvested by centeifiigation. Cell pellets were processed inmediately 
or stored at -20^C or -80^C until processing. The cell pellets derived from a 325-400 mL induced culture 
are first suspended in 10 mL of 8 M Guanidine, 20 mM Cysteine, 20 mM Mes, 2% Tween 20, pH 3 and 
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mixed until a homogeneous suspension is present. The pH is then increased to between pH 8-9 and the 
solubilization mixture is stirred for 3 hours. The cell lysate is next diluted 1:20 with ice cold renaturation 
buffer (20 mM Tris, pH 0.3 M guanidine, 1 M urea, 40 pm copper sulfate, pH 8). The cloudy suspension 
is allowed io sit 1-2 days at 4^C. The refold is clarified by centrifiigation followd by a pH adjustment to 3 
5 and second round of centrifiigation. The supernatant is diluted 1 :4 with cold water and load onto a S mL S- 
Seph Hi Trap. The ion exchange column is eluted with a 100 mL gradient of 0-70% Buffer B, with BuflFer 
A being 20 mM Mes, pH 5 and Buffer B being 10% E&ylene glycol 500 mM NaCl, 20 mM Mes pH 5. 
Alternatively, refolded BFN-a cysteine mutems can be captured fi:om the refold mixture using a HIC 
column, such as a Phenyl-Sepharose column. The refold mixture is first centrifiiged, ammonium sulfate is 

10 added to the supernatant to a final concentration of 10%, the mixture is recentrifuged, and the supernatant 
loaded onto a 10 mL Phenyl Sepharose column equilibrated in 10% ammonium sulfate, 20 mM Tris, pH8. 
IFN-a cysteme muteins are eluted from the column using a 100 mL linear gradient from 10% ammonium 
sulfate, 20 mM Tris pH 8 to 30% ethylene glycol, 20 mM Tris, pH 8. The interferon pool from a Phenyl- 
Sepharose column can be further purified using a copper chelating column, S-Sepharose column or both. 

15 Interferon cysteine muteins also can be solubilized and refolded using other reducing agents that 

also act as cysteine blocking agents. Substitution of reduced glutathione, thioglycolic acid or cysteamine for 
cysteine in the solubilization/refold mixtures yielded refolded, soluble IFN cysteine variants that could be 
purified and PEGylated following the procedures described m Example 7. When no reducuig agent or 20 
mM DTT was substituted for cysteme in the solubilization/refold mixtures, yields of refolded, soluble IFN 

20 cysteine mutems were reduced to non-detectable levels when the refold mixture was analyzed by Reversed 
Phase HPLC. Additionally, no refolded, soluble IFN cysteine mutein was recovered following S-Sepharose 
chromatography of the refold mixture when no reducing agent or 20 mM DTT was substituted for cysteine 
in the solubilization/refold mixtures. 

The following muteins were esqiressed m E coli, refolded and purified using Protocol 11: CIS, 

25 Q5C, 43C44, N45C, F47C, Q48C, A50C, C98S, QIOIC, T106C, E107C, S163C, B165C, *166C, D2C, 
L3C, T6C, S8C, T52C, G102C, V103C, G104C, Vi05C, P109C, L110C,M111C, S160C, L161C, R162C 
and K164C. These refolds were performed at pH 8 or in some instances 7.5. 

Example 6 

30 BioactivitiesofIFN-a2 Cysteine Muteins 

Biological activities of the purified Q5C, C98S, QIOIC, T106C, E107C, and *166C IFN-a2 
cysteine muteins that were purified using Protocol I of Example 5 were measured in the Daudi growth 
inhibition assay described in PCT/USOO/00931. Protein concentrations were determined using Bradford or 
BCA protem assay kits (Bio-Rad Laboratories and Pierce). Commercial wild type rIFN-a2 and rIFN-a2 

35 prepared as described in PCT/USOO/00931 were analyzed in parallel on the same days to control for 
interday variability in the assays. The mutems inhibited proliferation of Daudi cells to the same extent as 
the wild type rIFN-a2 control proteins, witiiin the error of the assay. Mean ICsoS for five of the muteins 
(Q5C, QIOIC, T106C, E107C and *166C) were similar to tiie mean IC50 s of the wild type rlFN-a proteins, 
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ranging from 15-18 pg/ml. The mean IC50 for the C98S protein was 28 pg/ml. These data are summarized 
in Table 4. 



Table 4. 

5 Expression and in vitro Bioactivities of IFN-a2 Cysteine Muteins 



IFN-02 
Protein 


Mutation 
Location 


Relative Expression 


Form 
Assayed 


Mean IC50 
(pg/ml) 


ICsoRange^ 
(pg/ml) 


Total 
Cellular ^ 


Percent 
Soluble^ 


rIFN-a2 ■* 










16 +/- 7 


8-29 {n==10) 


rIFN-a2* 




-H-++ 


-33 


Soluble 


13 +/-4 


7-19 (ir=10) 


CIS 


N-tennmal region 


+/- 


0 








Q5C 


N-terminal region 


-H-H- 


-20 


Refolded 


17 


15, 17, 20 


43C44 


A-B loop 


++ 


0 








N45C 


A-B loop 


++ 


0 








Q46C 


A-B loop 


+/- 


0 








F47C 


A-B loop 


++++ 


~5 








Q48C 


A-B loop 


+/. 


0 








A50C 


A-B loop 


+/- 


0 








D77C 


B-C loop 


+A 


0 








C98S 


C-helix^ 


1 1 1 1 I 


-5-10 


Refolded 


28 


22,30.32 


QIOIC 


C-D loop 


1 M 1 1 


-5-10 


Refolded 


18 


10,22,23 


T106C 


C-D loop 


1 1 1 M 


-5-10 


Refolded 


18 


18,18 


E107C 


C-D loop 


Mill 


-5-10 


Refolded 


18 


8,22.24 


T108C 


C-D loop 


+/. 


0 








S163C 


C-terrainal region 


++++ 


--33 








E165C 


C-terminal region 


+++ 


-20 








*166C 


C-terminus 


+++ 


-20 


Refolded 


15 


8,16,20 



* Relative accumulation of the IFN-a2 protein in whole cell extracts 

^ Portion of the IFN-a2 protein in the osmotic shock supernatant, determined from 



SDS-PAGEgels 

10 ^ IC50 values from individual experiments. A range is shown when N > 5. 
^ Commercial wild type rIFN-a2 (Endogen, Inc.) 
^ Wild type rIFN-a2 prepared by Bolder BioTechnology, Inc. 
^ Mutation creates a free cysteine (C98) in the C-helix 
^ Mutation creates a free cysteine (CI) in the N-terminai region 

15 

Biological activities of the following muteins, purified using Protocol II of Example 5, were 
measured in the Daudi growth inhibition assay described in PCT/USOO/00931: CIS, D2C, L3C, S8C, 
N45C, F47C, C98S, V103C, V105C, E107C, MlllC, R162C, S163C, K164C, E165C and *166C The 
observed ICsoS are listed in Table 5 along with IC50S for wild type rIFN-a protein controls used m the same 
20 experiments. 
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Table 5. 

In vitro Bioactivities of IFN-a2 Cysteine Muteins Purified by Protocol U yfith and without 

PEGylation 



IFNa2 
Mutant 


Mutation location 


IC50' (pg/ml) 


IC50 (pg/ml), 20K PEG-Protein ' 


RiFN-a2 




ID to 




KirXN-az 




1 A f n 1 no 
10 lO luy 






XT 1 ' 

N-tenninal region 








N-tenninal region 




inn 




N-terminal region 


OA 1^ 


ins 'in(\ 


coo 


N-terminal region 


0 1 






J\rO loop 






F47C 


A-B loop 


66, 56, 58 


120,72,240 


C98S' 


C-helix 


105, 110,100 


500, 720,900 


G104C 


C-D loop 


110 


600 


V105C 


C-D Loop 


38 


33 


E107C 


C-D loop 


90,98,110 


160, 220, 180 


Mine 


C-D Loop 


40 


190 


R162C 


C-terminal region 


600 


4000 


S163C 


C-terminal region 


70,50, 88 


310,125,360 


K164C 


C-ter 


100 


600 


E165C 


C-ter 


43, 60, 51 


160, 220, 300 


♦166C 


C-terminus 


48.78, 96 


120, 300 



5 * IC50 values from individual experiments. A range is shown when N > 5. 
^ Commercial wild type rIFN-a2 (Endogen, Inc.) 
^ Wild type rIFN-a2 prepared by Bolder BioTechnology, Inc. 
^ Mutation creates a free cysteine (C98) in the C-helix 
^ Mutation creates a free cysteine (CI) in the N-terminal region 

10 

Example 7 
PEGylation of IFNhx2 Cysteine Muteins 
The purified IFN-a2 cysteine muteins can be PEGylated using the procedures described in 

15 Example 3 and PCT/US98/14497 and PCT/USOO/00931. A smaU-scale PEGylation experiment was 
performed with two of the purified rIFN-a2 cysteine muteins to identify conditions that allowed the proteins 
to be monoPEGylated at the free cysteine residue. Over-reduction of the protems was monitored by non- 
reducing SDS-PAGE, looking for a shift to a higher than expected apparent molecular weight as a result of 
protein unfolding, or for the appearance of multiple PEGylated species generated as the result of native 

20 disulfide reduction. One fig aliquots of purffied wild type and the rIFN-a2 muteins T106C and E107C were 
incubated for 1 hour with a 10-fold molar excess TCEP and a 20-fold molar excess of 5 kDA maleimide 
PEG at pH 8.5 at room temperature. After 60 mm, the reactions were stopped and unmediately analyzed by 
non-reducing SDS-PAGE. Both mutems yielded monoPEGylated protein under these conditions, based on 
SDS-PAGE analysis of the reaction mixtures. The apparent molecular weights of the monoPEGylated 

25 proteins were approximately 28 kDa by non-reducing SDS-PAGE. Wild-type rIFN-a2 showed no 
detectEd)le PEGylation under these conditions. Contol experiments indicated that the T106C and E107C 
cysteine muteins needed to be partially reduced with a reductant such as TC£P to be PEGylated. These data 
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indicate that the PEG molecule is attached to the cysteine residue introduced into the T106C and B107C 
proteins. 

Larger quantities of the IFN-a2 cysteine muteins can be modified with cysteine-reactive PEGs of 
various sizes and purified to obtain sufBcient material for bioactivity measurements. For purification of the 
5 PEGylated proteins, fte larger PEGylation reactions should be performed as described above for 1 hr at 
room temperature, diluted lOX with 20 mM MES, pH 5.0, adjusted to pH 3.0, and then loaded quickly onto 
an S-Sepbarose column using conditions similar to those described for initial purification of the rIFN-oc2 
muteins. The presence of the PEG moiety decreases the protein's afGnity for the resin, allowing the 
PEGylated protein to be separated from the non-PEGylated protein. The chromatogram from the S- 

10 Sepharose column should show two major protein peaks. The early eluting major peak (eluting at an NaCl 
concentration less than 230 mM) should be the mono-PEGylated IFN-a protein, which can be confirmed by 
non-reducing SDS-PAGE analysis. The apparent molecular weight of monoPEGylated IFN-a2 that has 
been modified with, a 5 kDa cysteine-reactive PEG is approximately 28 kDa by SDS-PAGE. The later 
eluting major peak (eluting at approximately 230 mM NaCl) should be the unreacted IFN-a2 protein. 

15 Fractions from the early eluting peaks containing predominantly PEG-IFN-a2 can be pooled and used for 
bioactivity measurements. Biological activity of the purified PEG-IFN-a2 proteins can be measured in the 
Daudi cell assay described in PCT/US00/0093L Concentrations of the proteins can be determined using a 
Bradford dye binding assay. In vivo biological activities of the PEGylated IFN-Gt2 cysteine muteins can be 
determined as described in PCTAJS98/14497 and PCTAJS/USOO/0093 1 . 

20 For PEGylation of the Q5C mutein, the purified protein was diluted to 100 |ig/ml protein with 100 

mM Tris, pH 8. A 15-fold excess of 5 kDa- maleunide PEG is added followed by 10-15-fold molar excess 
of TCEP. EDTA was also added (0.5 mM final concentration) to inhibit disulfide formation once the 
protein is partially reduced. The mixture was held at room temperature, 2 hours. An alternative method 
that also gave good PEGylation efficiency involved repeated additions of the PEG and TCEP reagents. We 

25 have found that 3 rounds of addmg lOX molar excess PEG reagent and lOX molar excess TCEP over a 
period of 2 hours gave greater than 80% PEGylation efficiency. This latter procedure of repeated additions 
of the PEG and TCEP reagents was used successfully to prepare Q5C modified with lOkDa-, 20kDa- and 
40kDa-PEGs. The PEGylated proteins were separated from unreacted Q5C startmg material and 
PEGylation reagents by ion-exchange chromatography using the S-Sepharose protocol described in 

30 Example 5. Alternative methods such as other ion exhangers (Q, DEAE, CM), HIC resins (Phenyl, Butyl) , 
a£Qnity columns , size exclusion columns, or chelating resins may be used to purify the PEGylated protein. 

Biological activity of the purified 10 kDa-, 20 kDa- and 40 kDa-PEG-Q5C {mteins were measured 
in the Daudi cell assay described in PCT/USOO/00931. Concentrations of flie proteins were determined 
using a Bradford dye binding assay. Mean ICsoS for flie 10 kDa-PEG-Qi5C, 20 kDa-PEG-Q5C, and 40 kDa- 

35 PEG-Q5C proteins were determined to be 70 pg/ml P^=2 assays), 100 pg/ml ^=8 assays), and 108 pg/ml 
(N~8 assays), respectively. 
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Example 8 

Cloning, Expression and Purification of Wild Type G-CSF and G-CSF (C17S) 
A. Cloning DNA sequences encoding GrCSF. A cDNA encoding G-CSF was amplified by PCR 
from total KNA isolated from the human bladder carcinoma cell line 5637 (American Type Culture 
5 Collection). The cells were grown in RPMI 1640 media supplemented with 10% FBS, SO imits/ml penicillin 
and 50 ^g/ml streptomycin. RNA was isolated from the cells usmg an RNeasy Mini RNA isolation kit 
purchased from Qiagen, Inc. (Santa Clarita, CA) foUowmg the manu&cturer*s directions. First strand 
synthesis of smgle-stranded cDNA was accon^>lished using a 1st Strand cDNA Synthesis Kit for RT-PCR 
(AMV) from Boehringer Mannheim Corp and random hexamers were used as the primer. Subsequent PCR 

10 reactions usmg the products of the first strand synthesis as template were carried out with forward primer 
BB91 (5>CGCAAGCTTGCCACCATGGCTGGACC TGCCACCCAG>3; SEQ ID N0:1) and reverse 
primer BB92 (5>CGCGGATCCTCCGGAGGGCTGGGCAAGGT GGCGTAG >3; SBQ ID N0:2). Primer 
BB91 anneals to the 5' end of the coding sequence for the G-CSF secretion signal and the reverse primer, 
BB92, anneals to the 3' end of the G-CSF coding sequence. The resulting 640 bp PCR product was 

15 digested with Hind EI and Bam HI, gel purified and cloned into pCDNA3.1(+) vector that had been 
digested with Hind HI and Bam HI, alkaline phosphatase treated, and gel purified. A clone with the correct 
DNA sequence (Souza et al., 1986; Nagata et al., 1986a,b) was designated pCDNA3.1(+)::G-CSFfus or 
pBBT165. 

PCR was used to modify this G-CSF clone for periplasmic and cytoplasmic expression in E, coli of 

20 wild type G-CSF (wild type) and a variant in which the naturally occuning free cysteine at position 17 was 
replaced by serine (C17S). The wild type G-CSF protein contains 5 cysteines, two of which participate m 
critical disulfide bonds and one free cysteine (C17) that is partially buried and not required for activity 
(Ishikawa et al., 1992, Kuga et al., 1989, Lu et al., 1992, Wmgfield et al., 1988). To avoid potential 
difficulties caused by the unpaired cysteme, we constructed a variant containing the Cys to Ser substitution 

25 at position 17 (C17S) as our platform molecule. All subsequent cysteine muteins were prepared with the 
C17S substitution present. G-CSF (C17S) has been reported to possess biological activity identical to wild 
type G-CSF (Ishikawa et al., 1992, Lu et al., 1992). 

Secreted G-CSF does not contam an added N-terminal methionine and has an amino acid sequence 
identical to naturally occurring G-CSF (Souza et al, 1986). In order to express a secreted form of G-CSF, 

30 PCR was used to fuse the leader sequence of the E, coli heat-stable enterotoxm (STII) gene (Picken et al, 
1983) to the coding sequence for mature G-CSF and a TAA stop codon was added following the carboxy- 
terminal residue, P174. At the same time, the aminoterminal portion of the G-CSF coding sequence was 
also modified. Codons for prolines at positions 2, 5, and 10 were all changed to CCG, and an Xlio I 
restriction site was introduced by changing the LI 8 codon from TTA to CTC in order to facilitate 

35 subsequent mutagenesis procedures. 

These constructions were carried out in parallel for the wild type and C17S genes and employed 
three sequential PCR reactions. For the C17S construct, the first reaction used forward primer BB116 (5> 
GGCCCGGCCAGCTCCCTGCCGCAGAGCTTCCTQCTGAAGAGCCTCGAG 

CAAGTGCGTAAGATCCAG>3; SEQ ID N0:3) and reverse primer BBli4 (5>CGCGAATTCTTAGGG 
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CTGGGCAAGGTGGCG >3; SEQ ID N0:4) and the cloned G-CSF cDNA as template. BBl 16 anneals to 
the S' end of the coding sequence of mature G-CSF and introduces the codon changes noted above at P2, 
PS, PIO, and L18 which do not change the amino acids encoded. It also introduces the C17S mutation 
(TGC ^AGC) and changes the leucine codon at position IS to the preferred CTG triplet. BBl 14 anneals 
S to the 3' end (18 bp) of the G-CSF coding sequence and introduces a TAA translational stop codon 
immediately following the the carboxy-temiinal residue, P174. BB114 also contains an Eco RI site for 
cloning purposes. For the wild type construct, the first reaction used forward primer BBl 17 <S> 
GGCCCGGCCAGCTCCCTGCCGCAGAGCTTCCTGCITAAGTGCCTCGAGCAAGT^ 
CAG >3; SEQ ID N0:5) and reverse primer BB114 (sequence above) with the cloned G-CSF cDNA as 

10 template. BB117 is identical to BBl 16 with two exceptions; the naturally occurring C17 codon, TGC, is 
present and the LIS codon used is CTT. This CTT creates an 4/7 11 restriction site in order to provide a 
rapid and convenient method for distinguishing wild type C17 clones from the C17S variant. The C17S 
clones carry the CTG codon at position 15 and therefore lack the Afl H rsetriction site. The ~530 bp PGR 
product from each of these reactions was gel purified and used as template for the second PCR reaction. 

15 For the second reaction each of the ~530 bp gel purified products was amplified with forward 

primer BB115 (5> ATGTTCGTTTTCTCTATCGCTACCAACGCGTACGCAACCCCGCTG 
GGCCCGGCCAGCTCCCTG >3; SEQ ID N0:6) and reverse primer BBl 14 (described above). The 3' 
portion (27 nucleotides) of BBl 15 anneals to the 5' end of the modified coding sequence of mature G-CSF 
which is identical in both the wild type and C17S PCR products. The 5' segment (36 nucleotides) of BBl 15 

20 encodes a portion of the STII leader peptide. The ~550 bp PCR products of each of these secondary 
reactions were gel purified and used as template for the third and final round of PCR. 

In the third reaction each of the ^SSO bp gel purified products was amplified with forward primer 
BBll (5>CCCCCTCTAGACATATGAAGAAGAACATCGCATTCCTGCTGGCATCTATGTTCGT 
TTTCTCTATCG > 3; SEQ ID N0:7) and reverse primer BBl 14 (described above). BBll adds the 

25 remainder of the STII leader peptide and contains an Nde I site overlapping the initiator ATG of the STII 
leader as well as an Xba I site for cloning purposes. The ~620 bp products of the these reactions were 
digested with Eco RI and Xba I and cloned into similarly digested plasmid vector pBC-SK(+) (Stratagene) 
for sequencing. 

For the wild type construct, one clone, designated pBBT187, was found to contain the correct 
30 sequence for the 620 bp Nde I - Eco RI segment containing the STE-G-CSF coding sequence. This 
firagment was then subcloned into {Nde I + Eco RI) cut expression vector pCYBl (New England BioLabs). 
The resulting plasmid was termed pBBT188. For the C17S construct, none of three clones sequenced was 
found to contain the correct sequence; all had one or more errors. One clone contained a single raissense 
mutation at the AlO position of the STII leader; the rest of the sequence of the 620 bp Nde I - Eco RI 
35 segment was correct. In vitro recombination between this clone and plasmid pBBT188 was used to generate 
a STn-G-CSF(C17S) construct of the correct sequence in pCYBl. pBBT188 and the C17S clone 
containing the single missense mutation at the AlO position of the STII leader, were bodi digested with Bsi 
WI and Eco RI. The only Eco RI site present in either plasmid is that v4iich follows the G-CSF translational 
stop codon. Bsi WI ^so cuts only once at a site within the coding sequence t>f the STII leader peptide, 7 bp 
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from the begiiming of the mature G-CSF coding sequence. Therefore by replacing the --535 bp Bsi WI - 
Eco RI fragment of pBBTlSS with the -535 bp Bsi WI - Eco RI fragment havmg the correct C17S construct 
sequence, we generated a pCYBl derivate to that expressed the STn-G-CSF(C17S) coding sequence. This 
plasmid was designated pBBT223 
5 For cytoplasmic expression m £. coU the cloned STH-G-CSF wild type and STII-G-CSF(C17S) 

genes were modified by PGR to eluninate the STII leader sequences and add an initiator methionine codon 
(ATG) immediately preceding the codon of the amino-terminal ammo acid (Ti) of mature G-CSF. The 
sequence-verified STU-G-CSF wild type and STII-G-CSF(C17S) clones were amplified with primers BB166 
(5> CGCCATATGACCCCGCTGGGCCCGGCCAG>3; SEQ ID N0:8) and BB114 (described above). 

10 BB166 anneals to the 5' end of the coding sequence of mature G-CSF and encodes an initiator methionine 
preceding the first amino acid of mature G-CSF. An Nde I site, which overlaps the ATG was included for 
cloning purposes. The -540 bp products of these PGR reactions were digested with Nde I plus Aat II, which 
cuts --400 bp downstream of the Nde I site. These -400 bp fragments were gel purified and cloned into 
pBBT187, the pBC-SK(+)::STn-G-CSF construct described above, which had been cut with Nde I plus Aat 

15 n, treated with alkaline phosphatase and gel purified. One Met-G-CSF wild type and one Met-G-CSF(C17S) 
clone were sequenced and both were found to contain the correct sequences. These Met-G-CSF wild type 
and Met-G-CSF(C17S) genes were subcloned as Nde I - Eco RI fragments into Nde I - Eco RI cut expression 
vector pCYBl, which is described above. The resulting plasmids were designated: pBBT225 
pCYBl::Met-G-CSF and pBBT226 = pCYBl::Met-G-CSF(C17S). 

20 B. Expression of Wild Type G-CSF and G-CSF (C17S) in R colL pBBT225, which encodes 

Met-G-CSF wild type, pBBT226 which encodes Met-G-CSF(C17S) and the pCYBl parent vector, were 
transformed into E. coli JM109. Experunents with these strams resulted in expression of the G-CSF 
proteins. Secreted G-CSF, both wild type and C17S forms, are preferable because they lack the non-natural 
methionine residue at the N-terminus of cytoplasmically-expressed Met-G-CSF proteins. 

25 For expression of secreted G-CSF, pBBT188 [pCYBl::STn-G-CSF3, pBBT223 [pCYBl::STn-G- 

CSF(C17S)] and the parental vector pCYBl were transformed mto E, coli W3110. The resulting strains 
were designated as BOB130: W3110(pCYBl), BOB213: W3110(pBBT188), and BOB268: 
W31 10(pBBT223). In preliminary screening experiments, strains were grown overnight in Luria Broth (LB 
media) contaming 100 ^g/ml ampicillin at 37**C in roll tubes. Saturated overnight cultures were diluted to ~ 

30 0.025 O.D. at Asoo in LB containing 100 ^g/ml ampicillin and incubated at 28, 37 or 42''C m shake flasks. 
Typically a 25 ml culture was grown in a 250 ml shake flask. When culture O.D.s reached -0.3 - 0,5, IPTG 
was added to a final concentration of 0.5 mM to induce expression of G-CSF. For initial experiments, 
cultures were sampled at 0, 1, 3, 5 and -16 h post-induction. Samples of induced and uninduced cultures 
were analyzed by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) on precast 14% Tris-glycine 

35 polyacrylamide gels and stained with Coomassie Blue. Induced cultures of both BOB213 (wild type) and 
BOB268 (C17S) showed a band at approximately 19 kDA, which is consistent with the mature G-CSF 
molecular weight. This band was not detected in the uninduced cultures of BOB213 and BOB268 or in 
mduced or uninduced cultures of 6OB130, the vector-only control. Western blot analyses showed that &is 
~19 kDa band in BOB213 and BOB268 lysates reacted stit>ngly with an anti-human G-CSF antiserum 
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(R&D Systems). This antibody did not recognize proteins in uninduced cultures of BOB213 and BOB 268 
or in induced or uniduced cultures of BOB 130, tiie vector only control. These Western blots also showed 
that this ~19 kDa band co-migrated with a commercial human G-CSF standard purchased from R&D 
Systems. This result suggests that the STII leader peptide has been removed, which is consistent with the 
S protein having been secreted to the periplasm. N-terminal sequencing studies presented in Example 10 
indicate the STII signal sequence was properly processed. 

The 16 hour post-mduction sanqples from 28°C and 37°C cultures also were subjected to osmotic 
shock based on the procedure of Koshland and Botstein (1980). This procedure ruptures the E. coli outer 
membrane and releases the contents of the periplasm into the surrounding medium. Subsequent 

10 centrifugation separates the soluble periplasmic components (recovered in the supernatant) from 
cytoplasmic, insoluble periplasmic, and cell-associated components (recovered in the pellet). At both 
temperatures, some of the G-CSF protein synthesized, for both wild type, by BOB213, and C17S by 
BOB268 was recovered in the supernatant, but the bulk of the G-CSF proteins remained associated with the 
pellet. This indicates that while the protein appears to be processed and secreted to the periplasm, it is 

15 accumulated there primarily in an insoluble form. 

The preliminary screen of expression conditions for G-CSF wild type and the C17S variant showed 
that both proteins were relatively well expressed under a variety of conditions. For large scale expression 
and purification cultures were grown at 28^C and induced for ~16 hours. 

C. Purification of WUd Type G-CSF and G-CSF (CITS). Wild type and G-CSF (C17S) were 

20 expressed and purified at a larger scale using identical protocols. Fresh saturated ovemight cultures of 
BOB213 (wild type) and BOB268 (C17S) were inoculated at - 0.05 OD @ Aeoo in LB containing 100 fig / 
ml anq)icillin. Typically, 400 ml cultures were grown in a 2L baflfled shake flask at 28**C in a gyrotory 
shaker water bath at 250 ipm. When cultures reached a density of - 0.5 - 0.7 OD, IPTG was added to a 
final concentration of 0.5 mM. The induced cultures were then incubated overnight for ^16 h. The cells 

25 were pelleted by centrifugation and frozen at -80° C. Cell pellets were thawed and treated with 5 mL of B- 
PBR ™ bacterial protem extraction reagent according to the manufecturer's (Pierce) protocols. The 
insoluble material, which contained the bulk of the G-CSF protein, was recovered by centrifugation and 
resuspended in B-PER. This mixture was treated with lyso^ane (200 fig/mL) for 10 min to further disrupt 
the cell walls, and MgCl2 (10 mM final concentration) and protease-free DNAse <2 fig/ml) weie added. 

30 Insoluble G-CSF was collected by centrifugation and washed, by resuspension in water and recentrifugation, 
to remove most of the solubilized cell debris. The resulting pellet containing insoluble G-CSF was dissolved 
in 20 ml of 8 M urea, 25 mM cysteine in 20 mM Tris Base. This mixture was stirred for 30 min at room 
temperature then diluted into 100 ml of 40 mM sodium phosphate, 40 copper sulfate, 15% glycerol, pH 
8.0. This refold mixture was held at 4**C for 2 days. The pH of the refold mixture was then adjusted to 4.0 

35 with dilute HCl and the mixture was centrifuged before being loaded onto a 5 ml S-Sepharose column 
(Pharmacia HiTrap) equilibrated in 40 mM sodium phosphate pH 4.0 (Buffer A). The bound proteins were 
eluted widi a linear salt gradient from 0-100% Buffer B (500 mM NaCl, 40 mM sodium phosphate, pH 4.0). 
Wild type G-CSF and G-CSF (C17S) eluted from the S-Sepharose column as single major peaks at a salt 
concentation of approximately 300-325 mM NaCl, Column fractions were analyzed by non-reducing SDS- 
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PAGE. Fractions containing G-CSF and no visible impurities were pooled. The final yields of G-CSF wild 
type and G-CSF (C17S), as determined by Bradford analysis, were about 1.1 mg and 3.3 mg, respectively 
from 400 ml of culture. Purified wild type G-CSF and G-CSF (C17S) comigrated under reducing and non- 
reducing conditions of SDS-PAGE. The apparent molecular weights of reduced and non-reduced G-CSF 
5 and G-CSF (C17S) are approximately 19 and 17 kDa, respectively. 

D. In Vitro Bioactivities of Wild Type G-CSF and G-CSF (C17S). A cell proliferation assay 
using the murine NFS60 cell line was developed to measure bioactivities of wild type G-CSF and G-CSF 
(C17S). The NFS60 cell line was obtained from Dr. J. Ihle of the University of Tennessee Medical School, 
Memphis Tennessee, This cell line proliferates in response to human or mouse G-CSF or IL-3 (Weinstein et 

10 al., 1986). The ceils were maintained in RPMI 1640 media supplemented with 10% FBS, 50 units/ml 
penicillin, 50 ng/ml streptomycin and 17-170 units/ml mouse IL-3 (R&D Systems). Bioassays were 
performed in cell maintenance media minus IL-3. In general, the bioassays were set up by washing the 
NFS60 cells three times with RPMI media (no additives) and resuspending the cells at a concentration of 
0.5-lxloVml in cell maintenance media minus IL-3. Fifty ^1 (2.5-5x10^ cells) of the cell suspension was 

15 aliquotted per test well of a flat bottom 96 well tissue culture plate. Serial dilutions of the protein samples to 
be tested were prepared in maintenance media minus rL-3. Serial dilutions of recombinant human G-CSF 
(E. co//-expressed; R&D Systems) were analyzed in parallel. Fifty ^1 of the diluted protein sanq)les were 
added to the test wells and the plates incubated at 37°C in a humidified 5% CO2 tissue culture incubator. 
Protein samples were assayed in triplicate wells. After approximately 48-72 h, 20 jil of CellTiter 96 

20 AQueous One Solution (Promega Corporation) was added to each well and the plates incubated at 37°C in 
the tissue culture incubator for 1-4 h. Absorbance of the wells was read at 490 nm using a microplate reader. 
Control wells contained media but no cells. Mean absorbance values for the triplicate control wells were 
subtracted from mean values obtained for the test wells. EC50S, the concentration at half maximal 
stimulation, were calculated for each sample. 

25 The NFS60 cell line shows a strong proliferative response to G-CSF, as evidenced by a dose- 

dependent increase in cell number and absorbance values. Commercial G-CSF and G-CSF prepared by us 
had mean EC50S of 19 and 10 pg/ml, respectively, in flie bibassay (Table 6). Unexpectedly, G-CSF (C17S) 
had a mean EC50 of 7 pg/ml and was reproducibly 1.5-to 2-fold more potent than our wild type G-CSF 
standard and -S-fold more potent than tiie commercial wild type G-CSF standard in the bioassay (Table 3). 

30 The superior activity of G-CSF (C17S) was surprising because others have reported that wild type G-CSF 
and G-CSF (C17S) have identical activities (Lu et al., 1992). 

Example 9 

Construction, Expression, Purification and Bioactivity of G-CSF (C17S) Cysteine Muteins 
35 A. Construction of G-CSF Cysteine Muteins. 

Fifteen mutant G-CSF genes were constructed using site-directed PCR-based mutagenesis 
procedures sunilar to those described in PCT/USOO/00931 and Innis et al. (1990) and White (1993). We 
constructed five muteins in the amino-terminal region proximal to Helix A [*-lC (the addition of a cysteine 
residue onto the natural amino terminus), TIC, L3C, A6C and S7C]; two muteins in the B-C loop [E93C 
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and S96q; six muteins in the C-D loop [A129C, T133C, A136, A139C, A141C and S142C]; and two 
muteins in the carboxy-terminal region distal to Helix D [Q173C and *175C (the addition of a cysteine 
residue to the natural carboxy-terminus)]. The G-CSF cysteine mutems were all constructed in the C17S 
background to avoid potential difficulties and/or ambiguities that might be caused by the unpaired cysteine 
5 normally present at position 17 in wild type G-CSF. G-CSF (C17S) had previously been reported to possess 
full biological activity (Ishikawa et al, 1992; Lu et aL, 1992) and in our E. coli secretion system we find that 
the yields of purified C17S are higher than that of purified wild type G-CSF. In addition, in the in vitro 
assay our recombinant C17S is more active than wild type G-CSF produced by us and a second E. coli- 
produced recombinant wild type G-CSF obtained from a commercial vendor (R&D Systems, Inc.). 

10 The template used for the mutagenic PGR reactions was plasmid pBBT227 in which the STII-G- 

CSF (C17S) gene from pBBT223 (described in Example 8) was cloned as an Nde I - Eco RI fragment into 
iVi/e I-iJco RI cut pUC18. PCR products were digested with appropriate restriction endonucleases, gel- 
purified and ligated with pBBT227 vector DNA that had been cut with those same restriction enzymes, 
alkaline phosphatase treated, and gel-purified, Transformants from these ligations were grown up and 

15 plasmid DNAs isolated and sequenced. The sequence of the entire cloned mutagenized PCR fragment was 
determined to verify the presence of the mutation of interest, and the absence of any additional mutations 
that potentially could be introduced by the PCR reaction or by the synthetic oligonucleotide primers. 

The cysteine substitution mutation L3C was constructed as follows. The mutagenic forward 
oligonucleotide BB172 (5> ACCAACGCGTACGCAACCCCGTGTGGCCCGGCCAGC >3; SEQ ID 

20 N0:9) was designed to change the codon CTG for leucine at position 3 of mature G-CSF to a TGT 
encoding cysteine and to span the nearby Mlu I site. This oligo was used in PCR with the reverse, non- 
rautagenic, primer BB188 (5> GCCATCGCCCTGGATCTTACG >3; SEQ ID NO: 10) which anneals to 
DNA sequences encoding amino acid residues 21 -27 of mature G-CSF in pBBT227. A 100 nl PCR 
reaction was performed in IX Promega PCR buffer containing 1.5 mM MgCt, each primer at 0.4 |liM, each 

25 of dATP, dGTP, dTTP and dCTP at 200 ^M, 3 ng of template plasmid pBBT227 (described above), 2.5 
units of Taqi Polymerase (Promega), and 0.5 units of Pfu Polymerase (Stratagene). The reaction was 
performed in a Perkin-Elmer GeneAn?)® PCR System 2400 thermal cycler. The reaction program entailed: 
96**C for 3 minutes, 25 cycles of {95** C for 60 seconds, SV" C for 30 seconds, 72° C for 45 seconds] and a 
hold at 4^C. A 10 jxl aliquot of the PCR reaction was analyzed by agarose gel electrophoresis and found to 

30 produce a single fragment of the expected size ~ 100 bp. The remainder of the reaction was "cleaned up" 
using the QIAqiiick PCR Purification Kit (Qiagen) according to the vendor protocol and digested with Mlu I 
and Xho I (New England BioLabs) according to the vendor protocols. Following an additional clean step 
using the QIAquick PCR Purification Kit, tiie digestion products w«:e ligated with pBBT227 tiiat had been 
cut with Mlu I and Xho I, treated wifli calf intestinal alkaline phosphatase (New England BioLabs) and gel 

35 purified. The ligation reaction was used to transform E. <oli and plasmids from resulting transformants were 
sequenced. A clone having flie L3C mutation and tte correct sequence throughout the ^70 bp Mlu I - Xho I 
segment was identifiied. 

Hie substitution mutation TIC was -constructed and sequence verified using the irotocols detailed 
above for L3C with tiae following differrace. The mutagenic oligonucleotide BB171 (5> ACCAACGCG 
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TACGCATGCCCGCTGGGCCCGGCCAGC >3; SEQ ID N0:1 1), which changes the ACC codon for Tl 
to a TGC codon for cysteine and spans the nearby Mlu I site, was used in the PGR reaction in place of 
BB172. 

The substitution mutation Q173C was constructed and sequence verified using the protocols 
5 detailed above for L3C with tiie following dififerences. The mutagenic reverse oligonucleotide BB185 (5> 
CGCGA ATTC TTAGGGACAGGCAAGGTGGCG >3; SEQ ID N0:12), which changes the CAG codon 
for Q173 to a TGT codon for cysteme and spans the nearby Eco RI site, was used in the PGR reaction in 
place of BB172; The forward , non-mutagenic, primer BB187 (5> GCCATCGCCCTGGATCTTACG >3; 
SEQ ID NO: 13) which anneals to the DNA sequence encoding amino acid residues 78 - 84 of mature G- 

10 CSF m pBBT227 was used in place of BB188. A 10 ^1 aliquot of the PGR reaction was analyzed by 
agarose gel electrophoresis and found to produce a single fragment of the expected size 300 bp. The 
remainder of the reaction was "cleaned up" using the QIAquick PGR Purification (Qiagen) accordmg to the 
vendor protocol and digested with Sty I and Eco RI (New England BioLabs) according to the veiidor 
protocols. Following an additional clean \xp step using the QIAquick PGR Purification Kit, the digestion 

15 products were run out on a 1.5 % agarose gel and the -220 hp Sty I - Eco RI fragment of mterest was gel 
purified using a QIAquick Gel Extraction Kit (Qiagen) according to the vendor protocol. The gel purified 
fragment was ligated witii pBBT227 that had been cut with Siy I and Eco RI, treated with calf intestinal 
alkaline phosphatase (New England BioLabs) and gel purified. The ligation reaction was used to transform 
E. coll and plasmids from resulting transfonnants were sequenced. A clone having the Q173C mutation and 

20 the correct sequence throughout the -220 bp Sty 1- Eco RI segment was identified. 

A mutation was also constructed that added a cysteine following the carboxyterminal amino acid of 
the G-CSF coding sequence. This mutant, termed *175C was constructed using the protocols described 
above for the construction of the Q173C mutant with the following differences. The mutagenic 
oligonucleotide BB186 (5> CGCGAATTCTTAACAGGGCTGGGCAAGGTGGCGTAG >3; SEQ ID 

25 NO: 14), which inserts the a TGT codon for cysteine between the CCC codon for P174 and a TAA stop 
codon and spans the nearby Eco RI site, was used in the PGR reaction in place of BB185. 

The substitution mutation A6C was constructed using the technique of "mutagenesis by overlap 
extension" as described in Horton et al. (1993) and PCT/USOO/00931. The initial, or "primary" PGR 
reactions for the A6C construction were performed in a 50 jil reaction volume in IX Promega PGR buffer 

30 containing 1,5 mM MgCh , each primer at 0.4 \M, each of dATP, dGTP, dTTP and dCTP at 200 \M, I ng 
of template plasmid pBBT227, 1.5 units of Taq Polymerase (Promega), and 0.25 units of Pfti Polymerase 
(Stratagene). The reactions were performed in a Perkm-Ehner GeneAmp® PGR System 2400 thermal 
cycler. The reaction program entailed: 96**C for 3 minutes, 25 cycles of [95** C for 60 seconds, 60** C for 30 
seconds, 72° C for 45 seconds] and a hold at 4**C. The primer pairs used were [BB173 x BB188] and 

35 [BB174 X BB125]. BB188 (5> GCCATCGCCGTGGATCTT ACG >3; SEQ ID NO: 10) anneals to DNA 
sequences encoding amino acid residues 21 -27 of mature G-CSF in pBBT227. BB125 (5> CTATGC 
GGCATCAGAGCAGATA >3; SEQ ID NO: 17) anneals to the pUClS vector sequence -20 bp upstream 
of the cloned G-CSF sequence. BB173 and BB174 are complementary mutagenic oligonucleotides that 
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change the GCC codon for A6 to a TGC codon for cysteine. The sequence of BB173 is (5> 
CCGCTCXKJCCCGTGCAGCTCCCTGCCG >3; SEQ ID N0:15) and the sequence of BB174 is (5> 
CGGCAGGGAGCTGCACGGGCCCAGCGG >3; SEQ ID NO: 16). The PGR products were run out on a 
2% agarose gel, which showed that the [BB173 x BB188] and [BB174 x BB125] PGR reactions gave 
5 products of the expected sizes: -80 bp for. [BB173 x BB188] and -140 bp for [BB174 x BB125]. These 
fragments were excised from the gel, pooled, and eluted together from the agarose gel slices usmg a 
QIAquick Gel Extraction Kit (Qiagen) according to the vendor protocol and recovered in 30 |iil 10 mM Tris- 
HCl (pH 8.5). These two mutagenized fragments were then "spliced" together in the subsequent, or 
"secondary" PGR reaction. In this reaction 3^1 of of the gel-purified PGR products of the primary reactions 

10 were used as ten^)late and BB125 and BB188 were used as primers. The reaction volume was 100 ^1 and 
2.5 units of Taq Polymerase and 0.5 units of Pfu Polymerase were enq)loyed. Otherwise, the reaction 
conditions were identical to those used in the primary reactions. An aliquot of the secondary PGR was 
analyzed by agarose, gel electrophoresis and the expected band of ~190 bp was observed The bulk of the 
secondary PCai reaction was "cleaned up" using the QIAquick PGR Purification <Qiagen), digested with 

15 Nde I and Xlio I (New England BioLabs) according to the vendor protocols. Following an additional clean 
IQ) using the QIAquick PGR Purification Kit, the digestion products were ligated with pBBT227 that had 
been cut with Nde I and Xho I, treated with calf intestinal alkaline phosphatase (New England BioLabs) and 
gel purified. The ligation reaction was used to transform E. coli and plasmids from resulting transformants 
were sequenced to identify a clone containing the A6C mutation and having the correct sequence throughout 

20 the -130 bp We I -A7?ol segment. 

The substitution mutation S7C was constructed and sequence verified using the protocols detailed 
above for A6C with the foUowmg differences. Gomplementaiy mutagenic primers BB175 (5> 
CTGGGCCCGGCCTGCTCCCTGCCGCAG >3; SEQ ED NO: 18) and BB176 (5> 
CTGCGGCAGGGAGCAGGCCGGGCCCAG >3; SEQ ID NO: 19), which change the AGC codon for 87 

25 to a TGC codon for cysteine, replaced BB 173 and BB 174 respectively in the primary PGR reactions. 

A mutation that added a cysteine codon prior to the codon for the amino-termiiial residue, Tl, of 
mature G-CSF was constructed and sequence-verified. This mutation, termed *-lC was constructed usmg 
the protocol described above for construction of A6G with the following differences. Complementary 
mutagenic primere BB206 (5> AAGCCGTAGGGATGTAGCGGGGTGGGC >3; SEQ ID NO:20) and 

30 BB207 (5> GCC CAGGGGGGTACATGGGTAGGCGTT >3; SEQ ID N0:21), which insert a TGC codon 
for cysteme between the GGA codon for the carboxyterminal residue of the STII leader sequence and the 
AGC codon for the ammo-termmal residue of mature G-GSF in pBBT227, replaced BB173 and BB174 
respectively m the primary PGR reactions. The primary PGR reactions were performed in a 20 ^1 reaction 
volume. Each primer was present at 0.5 pM. The reaction included 0.5 ng of template plasmid pBBT227, 2 

35 units of Taq Polymerase, and 0.25 units of Pfu Polymerase. The reaction program entailed: 95**G for 3 
minutes, 25 cycles of [94** C for 60 seconds, 60** G for 30 seconds, 72** C for 45 seconds] and a hold at 4'*C. 
The products of the primary reactions were loaded direcdy onto a preparative 2 % agarose gel The primary 
reactions gave products of the expected si2^: -100 bp for [BB206 x BB188] and -125 bp for [BB2i07 x 
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BB125]. la the secondary PCR, the reaction volume was 100 jil, 5 id of the gel-purified PGR products of 
the primary reactions used as template, BB187 and BB126 were used as primers, and 4 units of Taq 
Polymerase and 0.25 units of Pfu Polymerase were employed. Otherwise, the reaction conditions were 
identical to those used in the primary reactions. 
5 The substitution mutation A129C was constructed and sequence verified using the protocols 

detailed above for A6C with the following dififerences. The primary PGR reactions en5>Ioyed pruner pairs 
[BB177 X BB126] and [BB178 x BB187]. The reverse, non-mutagenic primer BB126 {5> 
TGTGGAATTGTGAGCGGATAAC >3; SEQ ID NO:22) anneals to tixe pUC18 vector sequence --40 bp 
downstream of the cloned G-CSF sequence. The forward, non-mutagenic, primer BB187 (5> 

10 GCCATCGCCCTGGATCTTACG >3; SEQ ID NO: 13) anneals to the DNA sequence encoding amino acid 
residues 78 - 84 of mature G-CSF in pBBT227. BB177 and BB178 are complementary mutagenic 
oligonucleotides that change the GCC codon for A129C to a TGC codon for cysteine. The sequence of 
BB177 is (5> GGAATGGCCCCTTGCCTGCAGCCCACC >3; SEQ ID NO:23) and the sequence of 
BB178 is (5> GGTGGGCTGCAGGCAAGGGGCCATTCC >3; SEQ ID NO:24). The products of tiie 

15 primary reactions gave products of the expected sizes: -220 bp for [BB177 x BB126] and ~170 bp for 
[BB178 X BB187]. The secondary PCR employed BB187 and BB126 as primers and produced a product of 
tiie expected size: -360 bp. This product was digested with Sty I and Eco RI (New England BioLabs) 
according to the vendor protocols. Following an additional clean up using the QIAquick PCR Purification 
Kit, the digestion products were ligated with pBBT227 that had been cut with Sty I and Eco RI, treated with 

20 calf mtestinal alkalme phosphatase (New England BioLabs) and gel purified. The ligation reaction was 
vised to transform E. coli and plasmids from resulting transformants were sequenced to identify a clone 
containing the A129C mutation and havmg the correct sequence throughout the ~230 bp Sty I - Eco RI 
segment 

The substitution mutation T133C was constructed and sequence verified using tiie protocols 
25 detailed above for A129C with the following differences. Complementary mutagenic primers BB179 (5> 
GCCCTGCAGCCCTGCCAGGGTGCCATG >3; SEQ ID NO:2i5) and BB180 (5> 
CATGGCACCCTGGCAGGGCTGCAG GGC >3; SEQ ID NO:26), which change the ACC codon for 
T133 to a TGC codon for cysteuie, replaced BB173 and BB174 respectively m tiie prunary PCR reactions. 
The products of the primary reactions gave products of the expected sizes: '^205 bp for |iBB179 x BB126] 
30 and-180bpfor[BB180xBB187]. 

The substitution mutation A139C was constructed and sequence verified using the protocols 
detailed above for A129C wifli tiie following differences. Complementary mutagenic primers BB181 (5> 
GGTGCCATGCCGTGCTTCGCCTCTGCT >3; SEQ ID NO:27) and BB182 (5> 
AGCAGAGGCGAAGCACGGCATGGCACC >3; SEQ ID NO:28), which change tiie GCC codon for 
35 A139 to a TGC codon for cysteine, replaced BB173 and BB174 tespectively in the primary PCR reactions. 
The products of the primary reactions gave products of the expected sizes: '-185 1^ for (BB181 x BB126] 
and --200 bp for {BB182 x BB187]. 

The substitution mutation S142C was constructed and sequence verified using the protocols 
detailed above for A129C with the followmg differences. Complementary mutagenic primers BB183 (5> 
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CCGCTCCTTCGCCTGTGCTTTCCAGCGC >3; SEQ E> NO:29) and BB184 (5> 
GCGCTGGAAAGCACAGGCGAAGGCCGG >3; SEQ ID NO:30), which change the TCT codon for 
S142 to a TGT codon for cysteine, replaced BB173 and BB174 respectively in the pmnaiy PGR reactions. 
The products of the primary reactions gave products of the expected sizes: '^ISO bp for [BB183 x BB126] 
5 and -210 bp for [BB184 x BB187]. 

The substitution mutation A136C was constructed and sequence verified using the protocols 
detailed above for AI29C with the following differences. Conq)lementary mutagenic primers BB224 (5> 
CCCACCCAGGGTTGCATGCCGGCCTTC >3; SEQ ID N0:31) and BB225 (5> 
GAAGGCCGGCATGCAACCCTGGGTGGG >3; SEQ ID NO:32), which change the GCC codon for 

10 A136 to a TGC codon for cysteine, replaced BB173 and BB174 respectively in the primary PGR reactions. 
The primary PGR reactions were performed in a 20 ^il reaction volume. Each primer was present at 0.5 pM. 
The reaction included 0.5 ng of template plasmid pBBT227, 2 units of Taq Polymerase, and 0.25 units of 
P& Polymerase. The reactions were performed a Perkin-EImer GeneAmp® PGR System 2400 thermal 
cycler. The reaction program entailed: 95°C for 3 minutes, 25 cycles of [94** C for 60 seconds, 60° C for 30 

15 seconds, 72° C for 45 seconds] and a hold at 4°G. The products of the primary reactions were loaded 
directly onto a preparative 2 % agarose gel. The primary reactions gave products of the e;q)ected sizes: ~195 
bp for [BB224 x BB126] and -190 bp for [BB225 x BB187]. In the secondary PGR, the reaction volume 
was 100 (il, 5 ^1 of the gel-purified PGR products of the primary reactions were used as template, BB187 
and BB126 were used as primers, and 4 units of Taq Polymerase and 0.25 units of Pfii Polymerase were 

20 enq}loyed. Otherwise, the reaction conditions were identical to those used in the primary reactions. 

The substitution mutation A141G was constructed and sequence verified using the protocols 
detailed above for A136C with the following differences. Complementary mutagenic primers BB226 (5> 
ATGCCGGCCTTCTGCTCTGCTTTCCAG >3; SEQ ID NO:33) and BB227 
(5>CTGGAAAGCAGAGCAGAAGGCCGGCAT >3; SEQ ID NO:34), which change the GCC codon for 

25 A141 to a TGC codon for cysteine, replaced BB224 and BB225 respectively in the primary PGR reactions. 
The products of the primary reactions gave products of the expected sizes: -^180 bp for {BB226 x BB126] 
and -205 bp for [BB227 x BB187]. 

The substitution mutation E93C was <:onstructed using the technique of "mutagenesis by overlap 
extension". The primary PGR reactions for the E93C construction were performed in a 20 |il reaction 

30 volume in IX Promega PGR buffer contaming 1.5 mM MgCl2, each primer at 0.5 |jM, each of dATP, 
dGTP, dTTP and dCTP at 200 ^M, 0.5 ng of template plasmid pBBT227, 2 units of Taq Polymerase 
(Promega), and 0.25 imits of Pfu Polymerase (Stratagene). The reactions were performed in a Perkin-Elmer 
GeneAmp® PGR System 2400 thermal cycler. The reaction program entailed: 95°C for 3 minutes, 25 
cycles of [94° C for 60 seconds, 60° G for 30 seconds, 72** C for 45 in seconds] and a hold at 4*'C. The 

35 primer pairs used were [BB218 x BB2I1] and {BB219 x BB210]. The reverse, non-mutagenic primer 
BB211 (5> GGCCATTCCGAGTTCTTCCAT >3; SEQ ID NO:35) anneals to DNA sequences encoding 
amino acid residues 121-127 of mature G-CSF in pBBT227. The forward, non-mutagenic primer BB210 
(5> TTC GTTTTCTGTATGGGTAGGAAG >3; SEQ ID NO:36) anneals to DNA sequences encoding 
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amino acid residues 13 - 20 of the STII leader peptide in pBBT227. BB218 and BB219 are 
complementary mutagenic oligonucleotides that change the GAA codon for E93 to a TGT codon for 
cysteine. The sequence of BB218 is (5> CTGCAGGCCCTGTGTGGGATCTCCCCC >3; SEQ ID 
NO:37) and the sequence of BB219 is (5> GGGGGAGATCCCACACAGGGCCTGCAG >3; SEQ ID 
5 NO:38). The products of the primary reactions were loaded directly onto a preparative 2 % agarose gel 
which showed that PGR reactions gave products of the expected sizes: ~1 15 bp for [BB218 x BB21 1] and 
-325 bp for [BB219 x BB210]. These fragments were excised from the gel, pooled, and elutcd together 
&om the agarose gel slices using a QIAquick Gel Extraction Kit (Qiagen) according to the vendor protocol 
and recovered in 30 ^il 10 mM Tris-HCl (pH 8.5). In the secondaiy PGR reaction, 5 ^1 of the pool of gel- 

10 purified PGR products of the primary reactions was used as template and BB21 1 and BB210 were used as 
primers. The reaction volume was 100 ^1 and 4 units of Taq Polymerase and 0.25 units of Pfu Polymerase 
were enq)loyed Otherwise, the reaction conditions were identical to those used m the primary reactions. 
An aliquot of titie secondary PGR was analyzed by agarose gel electrophoresis, and the expected band of 
-415 bp was observed. The bulk of the secondary PGR reaction was "cleaned up" using the QIAquick PGR 

15 Purification (Qiagen) and digested with Sty I and Xho I (New England BioLabs) according to the vendor 
protocols. Following an additional clean up using the QIAquick PGR Purification Kit, the digestion 
products were ligated with pBBT227 that had been cut with Sty I and Xho I, treated with calf intestinal 
alkaline phosphatase (New England BioLabs) and gel purified. The ligation reaction was used to transform 
E, coli and plasmids from resulting transformants were sequenced to identify a clone containing the E93C 

20 mutation and having the correct sequence throughout the -260 bp I - Xho I segment. 

The substitution mutation S96C was constructed and sequence verified using the protocols detailed 
above for E93C with the following differences. Complementary mutagenic primers BB220 (5>GTG GAA 
GGG ATC TGC CCC GAG TTG GGT >3; SEQ ID NO:39) and BB221 (5> ACC GAA CTC GGG GCA 
GAT CCC TTG GAG >3; SEQ ID NO:40), which change the TCC codon for S96 to a TGC codon for 

25 cysteine, replaced BB218 and BB219 respectively in the primary PGR reactions. The products of the 
primary reactions gave products of the expected sizes: ~110 bp for [BB220 x BB211] and '-330 bp for 
[BB221xBB210], 

For e3q)ression in E, coli as protems secreted to the periplasmic space, the STH-G-CSF (G17S) 
genes encoding the muteins were excised from tiie pUC18-based pBBT227 derivatives as Nde l- EcoBI 

30 fragments of -600 bp, subcloned into the pCYBl expression vector, and transformed into E. coli W31 10. 

Usmg procedures similar to those described here, one can construct other cysteine mutems of G- 
GSF and G-GSF (C17S). The cysteine muteins can be substituticm mutations that substitute cysteine for a 
natural amino residue in the G-GSF coding sequence, insertion mutations that insert a cysteme residue 
between two naturally occurring amino acids in the G-GSF codmg sequence, or addition mutations that add 

35 a cysteme residue preceding the first amino acid, Tl, of the G-GSF coding sequence or add a cysteine 
residue following the terminal amino acid residue, P174, of the G-GSF codmg sequence. The cysteine 
residues can be substituted for any amino acid, or inserted between any two amino acids, anywhere in the G- 
GSF coding sequence. Preferred sites for substituting or inserting cysteine residues in G-CSF are in the 
region preceding Helix A, the A-B loop, the B-G loop, the C-D loop, and the region distal to Helix D. Other 
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preferred sites are the first or last three amino acids of the A, B, C, and D Helices. In addition to the 
mutations described above, oUbsx preferred residues in these regions for creating cysteme substitutions are 
P2, G4, P5, SB, L9, PIO, Qll, S12, T38, K40, S53, G55, 156, W58, A59, P60, L61, S62, S63, P65, S66, 
Q67, A68, Q70, A72, Q90, A91, L92, G94, 195, S96, E98, GlOO, G125, M126, A127, Q131, Q134, G135, 

5 S142, A143, Q145, and P174. All of the variants described in this Example are provided m the context of 
tiie natural protem sequence or a variant protein in which flie naturally occurring "free" cysteine residue 
(cysteine-17) has been changed to another amino acid, preferably serine or alanine. 

One also can construct G-CSF and G-CSF (C17S) muteins containing a free cysteine by 
substituting another amino acid for one of the naturally occurring cysteine residues in G-CSF that normally 

10 forms a disulfide bond. The naturally occurring cysteine residue that normally forms a disulfide bond with 
the substituted cysteine residue is now free. The cysteine residue can be replaced with any of the other 19 
amino acids, but preferably with a serine or alanine residue. These variants are provided in the context of the 
natural protein sequence or a variant protein in which the naturally occurring "free" cysteine residue 
(cysteine-17) has been changed to another amino acid, preferably serine or alanine. A free cysteine residue 

15 also can be introduced into G-CSF by chemical modification of a naturally occurring amino acid using 
procedures such as those described by Sjrtkowski et al. (1998). 

Using procedures shnilar to those described m Examples 8, 9, 10, 11 and 13 , one can express flie 
protems m E. coli, purify the proteins, PEGylate the proteins and measure then: bioactivities in in vitro and 
in vivo bioassays. The protems can be expressed cytoplasmically m E, coli or as proteins secreted to the 

20 periplasmic space. The mutems also can be expressed in eukaryotic cells such as insect or mammalian 
cells, usmg procedures similar to those described in PCT/USOO/00931, or related procedures well known to 
those skilled in the art. If secretion from eukaryotic cells is desired, the natural G-CSF signal sequence, or 
another signal sequence, can be used to secrete the protems from eukaryotic cells. 

B. Expression and Purification of G-CSF (C17S) Cysteine Muteins. E. coli strains expressing 
25 13 G-CSF (C17S) muteins (*-lC, TIC, L3C, A6C. S7C, E93C, A129C, T133C, A136C, A139C, A141C, 

Q173C, and *175C) were grown, induced and harvested using the protocols described m Example 8 that 
were employed for BOB213 (wild type) and BOB268 (C17S). All of the muteins were largely insoluble. 
The muteins were refolded and purified using the protocols described in Example 8 for G-CSF wild type 
and G-CSF (C17S). Non-reducing SDS-PAGE analysis revealed that the 13 purified cysteine muteins were 

30 recovered predominantly as monomers, migrating at approximately 17 kDa. The purified muteins 
comigrated with wild type G-CSF and G-CSF (C17S), with the exception of the *-lC mutein, which 
migrated slightly slower than wild type G-CSF. All but one of the muteins eluted from the ion-exchange 
column at a salt concentration similar to wild type G-CSF and G-CSF (C17S). The one exception, E93C, 
eluted later durmg the gradient (NaCl concentration of approximately 400 mM), possibly due to the 

35 substitution of cysteine for die charged amino acid, glutamic acid. 

C. Bioactivities of G-CSF (C17S) Cysteine Muteins. The 13 purified G-CSF tC17S) cysteine 
muteins were assayed in the NFS60 cell proliferation assay described in Example 8. Protein concentrations 
were determmed using a Bradford protein assay kit (Bio-Rad Laboratories). Commercial wild type G-CSF 
and wild t>^ G-CSF and G-CSF (C17S) iwrepar^d by us were analyzed m parallel on the smne days to 
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control for interday variability in the assays. All 13 muteins stimulated proliferation of the NFS60 cells to 
the same extent as the mid type G-CSF control proteins, within the error of the assay. Mean EC50S for the 
13 muteins ranged from 5-9 pg/ml. Mean EC50S for the cysteine muteins were similar to the mean ECso of 
the G-CSF (C17S) control protein and 1.5 to 2- fold lower , i.e., more potent, than the mean ECso for our 
wild type G-CSF control protein and 3-fold lower than the mean EC50 for the conmiercial wild type G- 
CSF protein. These data are summarized in Table 6, 

Table 6 

Bioactivities of Wild Type G-CSF, G-CSF (C17S) and G-CSF (C17S) Cysteine Muteins 



G-CSF 
Protein 


Mutation Location 


Mean ECso 
(pg/ml) 


ECso Range' 
(pg/ml) 


R&D G-CSF ^ 




18.6+/- 6.6 


12-35 (N=12) 


BBT G-CSF ^ 




10.2+/- 1.6 


8.5-13 (N=8) 


G-CSF (C17S) 




7.2+/- 2.0 


5-12 (N=18) 


MC/C17S 


N-terminus 


7.0 


5.8,6.0, 7.5, 8.5 


TIC/ " 


N-terminus 


7.8 


4.5,5.0,9.0, 10 


L3C/ " 


Proximal to A Helix 


8.0 


4.5, 7.5. 9.0, 9.0. 10 


A6C/ " 


Proximal to A Helix 


8.2 


4.5, 9.0, 11 


S7C/ " 


Proximal to A Helix 


7.3 


3.8, 8.0, 10 


E93C/ " 


B-C loop 


7.6 


6.5, 7.5. 8.0, 8.5 


A129C/ " 


C-D loop 


6.0 


6.0. 6.0. 6.0 


T133C/ " 


C-D loop 


6.6 


5.0, 6.0. 6.5. 7.5. 8.0 


A136C/ " 


C-D loop 


8.3 


7.0. 7.5, 8.5, 10 


A139C/ " 


C-D loop 


5.2 


5.0, 5.0. 5.5 


A141C/ " 


C-D loop 


8.9 


7.5, 8.5, 9.5, 10 


Q173C/ " 


Distal to D Helix 


6.2 +/- 1.3 


5.2-9.0 (N=7) 


*175C/ " 


C-teiminus 


5.6 


5.0. 5.5. 5.5. 6.0. 6.0 



^ Commercial wild type G-CSF (R&D Systems) 

^ Wild type G-CSF prepared by Bolder BioTechnology, Inc. 

15 D. Construction of G-CSF double cysteine mutants 

Multiple mutants containing two or more added free cysteine residues can be constructed either by 
sequential rounds of mutagenesis using the procedures described in Examples 9, 14 and 15, or alternatively 
by in vitro recombination of individual mutants to construct recombinant expression plasmids encoding 
muteins containing two or more free cysteine residues. The preferred multiple mutants would be those that 

20 combined two or more cysteine muteins that eadi retained high activity when PEGylated. Examples would 
be L3C plus T133C, L3C plus *175C, and T133C and *175C. Other preferred multiple mutants can be 
deduced based on the data from Table 3 and Table 4, and would include combinations containing two or 
more mutations selected from tiie group consisting of L3C, T133C, A141C and *175C. 

We constructed the following G-CSF double cysteine mutants: L3C/T133C, L3C/*175C, and 

25 T133C/*175C. To produce L3C/T133C, the L3C derivative of pBBT227 (G-CSF C17S in pUC18) was 
digested with Xho I and EcoR I, and treated with Calf Intestine Alkaline Phosphatase. The DNA was 
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extracted using the Qiagen PGR cleanup kit, and is called G-CSF L3C X-Rl-Cip vector. Next, the T133C 
derivative of pBBT227 was digested with Xho I and EcdK I, and the -480bp fragment was gel purified and 
ligated with the G-CSF L3C X-Rl-Cip vector. £. coli JM109 was transformed with the ligation reaction 
and clones having the correct sequence were identified. 
5 To produce L3C/*175C, the *175C derivative of pBBT227 was digested with Xho I and EcdR. I, 

and the ~480bp fragment was gel purified and ligated with the G-CSF L3C X-Rl-Cip vector (see above). E. 
coli JM109 was transformed with the ligation reaction and clones havmg the correct sequence were 
identified. 

To produce T133C/*175C, the T133C derivative of pBBT227 served as template in a PGR 
10 reaction using the reverse mutagenic oligonucleotide primer BB186 (5 > CGC GAA TTC TTA ACA GGG 
CTG GGC AAG GTG GCG TAG > 3; SEQ ID N0:14) and the forward non-mutagenic oligonucleotide 
BB125, which anneals to pUClS vector sequences upstream of the G-CSF insert. The PGR was a 50 jil 
reaction performed in IX Promega PGR buffer containing 1.5 mM MgCb, each primer at 0.4 jiM, each of 
dATP, dGTP, dTTP and dGTP at 200 ^iM, 0.5 ng of template firagment, 1 unit of Taq Polymerase 
15 (Promega), and 0.1 unit of Pfix Polymerase (Stratagene). The reaction was performed in a Perkin-Ehner 
GeneAmp® PGR System 2400 thermal cycler. The reaction program entailed: 95°G for 5 minutes, 22 
cycles of [94° C for 30 seconds, 55°C for 30 seconds, 72°G for 45 seconds], a 7 min hold at 72°G and a 
hold at 4°G. Twenty |il of the PGR were analyzed by agarose gel electrophoresis, and the '-630bp fragment 
was isolated from the gel. This fragment was digested with Xho I and £coR I, extracted using the Qiagen 
20 PGR cleanup kit. This DNA was ligated to a vector prepared by digestmg the T133C derivative of 
pBBT227 with Xho I and EcoK I, treating with Galf Intestine AlkaUne Phosphatase and extracting using the 
Qiagen PGR cleaniq) kit. E. coli JM109 was transformed with the ligation reaction and clones havmg the 
correct sequence were identified. 

25 Example 10 

PEGylation, Purification and Bioactivity of G-CSF Cysteine Muteins 
A. Preliminary PEGylation studies. Initial PEGylation reaction conditions were determined 
using TIC as the test protem, TCEP [Tris (2-carboxyefliyl) phosphineJ-HCl as the reducing agent and 5kDa 
cysteine reactive PEGs from Shearwater Polymers, Inc. Over-reduction of the protem was monitored by 
30 non-reducing SDS-PAGE, looking for a shift to a higher than expected apparent molecular weight as a result 
of protein unfolding, or for the appearance of multiple PEGylated species generated as the result of native 
disulfide reduction. One |ig aliquots of purified TIC were incubated with increasing concentrations of 
TCEP at room temperature in 100 mM Tris, pH 8.5 in the presence of varying amounts of excess 5 IdDa 
maleimide-PEG or 5kDa vinylsuifone-PEG. After 60 min, the reactions were immediately analyzed by non- 
35 reducing SDS-PAGE. The amounts of TCEP and particular PEG reagent that yielded significant amounts of 
monoPEGylated TIG protein, without modifying wild type G-GSF, were used for further experiments. The 
titration experiments indicated that at pH 8.5, a 10-fold molar excess of TCEP and 20-fold excess of 5 kDa 
maleimide PEG yielded significant amounts of monoPEGylated TIC protein (apparent molecular weight of 
28 kDa by SDS-PAGE) without detectable di- or tri-PEGylated protein. Wild type G-CSF and G-GSF 
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(C17S) were not modified under identical PEGylation conditions. These reaction conditions were used to 
scale up the PEGylation of the other G-CSF muteins. Control experiments indicated diat the TIC protein 
needed to be partially reduced by treatment with a reductant such as TCEP in order to be PEGylated. 

B. Preparation and Purification of PEGylated G-CSF Cysteine Muteins: Aliquots of 200 to 
S 300 of the 13 purified G-CSF cysteine muteins were PEGylated with a 5 kDa maleimide PEG to provide 

sufficient material for purification and characterization. The larger PEGylation reactions also were 
performed for 1 hr at room temperature, usmg the conditions described above. These reaction conditions 
yielded monoPEGylated protein for M of the muteins. Eleven of the monoPEGylated muteins have been 
purified using the procedure described below. At the end of the reaction time, the PEGylation mixture was 

10 diluted lOX with 40 mM sodium phosphate (monobasic) and the pH adjusted to 4.0 before being loaded 
quickly onto an S-Sepharose column (1 mL, HiTrap) using conditions similar to those described for the 
initial purification of the G-CSF muteins (20 mL gradient, 0-0.5 M NaCl in 40 mM sodium phosphate pH 
4). The presence of the PEG moiety decreased the protein's affinity for the resin, allowing the PEGylated 
protein to be separated fi*om the non-PEGylated protein. The chromatograms fi-om the S-Sepharose 

15 columns showed two major protein peaks eluting at approximately 275 mM NaCl and 300-325 mM NaCl 
for most muteins. The early eluting major peak was determined to be the mono-PEGylated G-CSF (C17S) 
mutein by SDS-PAGE. The later eluting major peak was determined to be the unreacted G-CSF {C17S) 
mutein. The PEG-E93C mutein eluted at about 325 mM NaCl versus about 400 mM NaCl for unreacted 
E93C protein. Fractions &om the early eluting peak containing predominantly the monoPEGylated G-CSF 

20 (C17S) mutein were pooled and used for bioactivity measurements. Five cysteine muteins (L3C, T133C, 
A141C, Q173C and *175C ) also were PEGylated usmg a 20 kDa PEG-maleimide and the PEGylation and 
piuification procedures described above. The 20 kDa-PEGylated proteins eluted firom the S-Sepharose 
column at approximately 250 mM NaCl. SDS-PAGE analyses mdicated that the purified PEGylated 
proteins contained less than 10%, and probably less than 5%, unPEGylated protein. The cysteine muteins 

25 needed to be partially reduced by treatment with a reductant such as TCEP in order to be PEGylated. Wild 
type G-CSF and G-CSF (C17S) did not PEGylate under identical partial reducing conditions, indicating that 
the PEG moiety is attached to the cysteine residue introduced into the muteins. 

C. Purification and PEGylation of the L3C G-CSF Cysteine Mutein: Time courses of the 
refold and the PEGylation reactions for L3C were performed. The refold for this particular mutein was 

30 found to be complete by 4 hours. The refold reaction progression was monitored by reverse phase HPLC 
(C4 column). Yields were ~10 mg/400 mL of culture grown as described in Example 8. Time courses were 
performed for the PEGylation of the L3C mutein with 10 kDa, 20 kDa and 40 kDa PEGs. PEGylation 
reaction conditions were as described above in Example 10, with the exeception that 0.5 mM EDTA was 
included in the PEGylation buffers. For 0.5 - 1 mg reactions, longer reactions times of 2-4 h at room 

35 temperature yielded greater amounts of PEGylated product. The efficiencies of PEGylation was -^0% with 
the extended time. Larger (up to 5 mg) PEGylation reactions were performed with equal efficiency. 
PEGylated protein was purified fi"om non-PEGylated protein on a 5 mL S-Sepharose column using the 
purification methodology previously described in Example 10. The 20 kDa PEGylated 
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protein eluted at -200 znM NaCl, while the 40 kDa-FEG protein and 10 kDa-PEG protein eluted at '^ISO 
mMand~220niM, respectively. The unPEGylated G-CSF L3C mutein eluted at -260 mM. The presence 
of EDTA significantly reduced the formation of protein dimers in the PEGylation reaction. 

D. N-tenninal sequencing of 20 kDa-PEG-L3C. The N-terminal amino acid of natural G-CSF is 
5 threonine (Souza et al., 1986). N-terminal sequencing of the purified 20 kDa-PEG-L3C protein using 

automated Edman degradation chemistry yielded the sequence TPXGPAS, which indicates that the N- 
terminus is correctly processed and is consistent with the third residue being PEGylated; PEGylated amino 
acids show up as blanks in sequencing runs, as indicated by the X.. 

E. Structural Determination of PEGylated G-CSF Cysteine Muteins by Circular Dichroism 
10 (CD) Analysis: CD analysis was performed on a Jasco 720 CD spectropolarimeter in a 1 cm pathlength 

300 ^iL cell at ambient temperature. Data were collected from 260 nm-200 nm at a sensitivity of 50m° and 
32 accumulations. Initial experimentation was performed with the L3C mutein and lOK PEG-L3C protein. 
Both had CD spectra very similar to that found in the literature for wild-type G-CSF. Similar analyses can 
be performed on other G-CSF cysteine muteins and their PEGylated derivatives. 

15 F. Bioactivities of PEGylated G-CSF (C17S) Cysteine Muteins: Biological activities of the 1 1 

purified 5 kDa- PEG-G-CSF (C17S) cysteine muteins and 5 purified 20 kDa-PEG-G-CSF (C17S) cysteine 
muteins were measured in the NFS60 cell proliferation assay described in Example 8. Concentrations of 
the proteins were determined using a Bradford dye binding assay. All of the PEGylated G-CSF (C17S) 
cysteine muteins showed similar dose-response curves and reached the same level of maximal growth 

20 stimulation as G-CSF (C17S), within the error of the assay. Mean EC50S for tiie 5 kDa-PEG modified 
cysteine mutems ranged from 2-11 pg/ml. These PEGylated muteins were 1.5- to 2-fold more potent than 
our wild type G-CSF and - 3-fold more potent than the commercial wild type G-CSF in the bioassay. Mean 
EC50S for the 20 kDa-modified cysteine muteins ranged from 9 to 14 pg/ml. Biological activities of the 
PEGylated G-CSF (C17S) cysteine muteins were equal to, or superior to, that of wild type G-CSF. All of the 

25 NFS60 cell stunulatory activity of 5 kDa-PEG-L3C could be abolished by a neutraUzing monoclonal 
antibody to G-CSF (R&D Systems, Inc.), indicating that the growth promoting activity is due to the PEG- 
L3C G-CSF protein and not to a contaminant in the protein preparation. The bioactivity data are 
summarized in Table 7. The ECso of L3C modified with a 40 kDa-PEG was detetmmed to be 30-50 pg/ml 
using the NFS60 cell proliferation assay. 

30 Biological activities of the PEGylated G-CSF (C17S) cysteine muteins described here are superior 

to the activities of previously described PEGylated G-CSF proteins, all of which have biological activities 
that are reduced relative to wild type G-CSF (Tanaka et al., 1991; Kinstler et al., 1996a; Bowen et aL, 
1999). Tanalca et al. (1991) reported that G-CSF modified with an amine-reactive 10 kDa NHS-PEG 
consisted of multiple molecular weight species and multiple isoforms modified at different lysine groups or 

35 the N-terminal amino acid. Biological activity of this NHS-PEG mixture was determined to be reduced 
approximately 3-fold relative to unmodified G-CSF (Tanaka et ah, 1991; Satake-Ishikawa et al., 1992). 
Bowen et al, (1999) reported that a G-CSF variant modified with 5 kDa-, 10 kDa- and 20 kDa-amine- 
reactive PEGs were reduced ^proximately 6-fold, 10-fold and 20-fold relative to unmodified G-CSF. 
Bowen et al, (1999) purified a single molecular weight species of the PEGylated G-CSF variant modified 
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with a 20 kDa-amine-reactive-PEG and found that its biological activity was reduced approximately 4-foid 
relative to unmodified G-CSF. Although the single molecular weight species isolated by Bowen et al. 
(1999) corresponded to the G-CSF variant modified with a single PEG molecule, the PEG-protein 
preparation was heterogeneous due to the PEG molecule bemg attached to the protein at multiple sites. 
5 Kmstier et al. (1996) purified a PEGylated Met-G-CSF species that is modified preferentially at the non- 
natural amino-terminal methionine residue of E, co//-expressed Met-G-CSF (cytoplasmically expressed) via 
amine or amide linkages. This PEGylated Met-G-CSF protein possessed only 68% of the in vitro 
bioactivity of wild type Met-G-CSF (Kinstier et al., 1996). 

10 Table 7 



Bioactivities of PEGylated G-CSF Cysteine Muteins 



G-CSF Protein 


ECjoS (pg/ml) 


SkDaPEG 


20kDaPEG 


Mean 


Range ' 


Mean 


Range' 


*-lC/C17S 


5.6 


5.5, 5.5, 5.5, 6.0 






TIC/ " 


7.0 


6.0,7.0,8.0 






L3C/ " 


5.5 . 


5.0, 5.3, 6.2 


8.8 


8.0. 8.0. 9.0, 10 


A6C/ " 


6.9 


6.0.6.0.7.5. 8.0 






S7C/ " 


2.4 


1.7.3.0 






E93C/ " 


1.9 


1.6.2.0,2.0,2.0 






A129C/ " 


7.1 


5.0,5.2.11 






T133C/ " 


7.4 


5.2,6.0,11 


9.0 


6.0,7.0.11. 12 


A136C/ " 


6.9 


6.0, 6.5, 6.5, 8.5 






A139C/ " 


6.8 


5.0.5.5, 10 






A141C/ " 


7.1 


6.5.7.0,7.0.8.0 


9.3 


6.0. 6.0. 12, 13 


0173C/ " 


7.0 


5.5. 5.5. 10 


11 


9.0, 10, 12, 13 


*175C/ " 


11 


10,11.12 


14 


12,12,16,16 



EC50 values from individual experiments 



15 Example 11 

Use of A Cysteine Blocking Agent Improves Recovery of Properly Folded G-CSF Cysteine Muteins 
Insoluble, E. co/i-expressed wild type G-CSF and G-CSF (C17S/Q173C) were refolded by 
procedures that varied the amount and type of reducing agent and the presence or absence of catalytic 
amounts of copper sulfate. 5 raM dithiothreitol (DTT) was chosen as the standard reducmg agent based on a 
20 literature reference that describes its use in an optimized refold protocol for G-CSF (Kuga et al;, 1 989). Lu 
et al. (1992) describes a protocol for refoldmg/renaturmg insoluble G-CSF that has no reducing agent 
present during the solubilization step but does contain 40 |iM copper sulfate in the renaturation buffers. 

E. coll cultures (400 mL) were grown and expression of each G-CSF protein was induced as 
described in Example 8. The cells were lysed and the insoluble portion was isolated by centrifiigation as 
25 described in Example 8. The msoluble material, which contained a majority of tiie insoluble G-CSF 
proteins, was suspended in 20 mL of 8 M urea, 20 mM Tris, pH 8 and stirred until homogeneous. The 
mixture was aliquotted into 6 tubes. 5 mM DTT or 2S mM cysteine were added to •certain of tiie tubes as 
described in Table 6. After one hour the solublization mixtures wer« diluted into 25 mL of 40 mM sodium 
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phosphate, 15% glycerol, pH 8 with and without 40 \M copper sulfate The refolds were allowed to sit at 
4**C for two days. At this time the pH of each was adjusted to 4. The refolds were centrifaged, the 
supematants loaded onto an S-Sepharose column and the G-CSF wild type and Q173C protems purified as 
described in Example 8. Column fractions were pooled based on non-reducing SDS-PAGB analysis, as 
5 described in Example 8. The amoimt of each protein recovered after chromatography is shown in Table 5. 

Table 8 

Recoveries of G-CSF Proteins Refolded/Renatured in the Presence 
And Absence of Different Reducing Agents 

10 



Refold 
Protocol 


Reducing Agent 


Copper 
Sulfate 


G-CSF (WT) 
Yield Oig)" 


G-CSF (C17S/Q173C) 
Yield (pg)" 


A 


None 


None 


49 


161 


B 


None 


40 mM 


24 


73 


C 


SmMDTT 


None 


17 


23 


D 


SmMDTT 


40 uM 


47 


53 


E 


25 mM cysteine 


None 


60 


243 


F 


25 mM cysteine 


40 nM 


80 


275 



^ Protein recovered from 67 ml of E. coli culture 



As shown in Table 8 the greatest yields of G-CSF wild type and the G-CSF cysteine mutein were 
achieved when cysteine was used as the reducing agent during the solubilization step. The presence of 

15 copper sulfate (40 jiM) appeared to marginally enhance recoveries when used in conjunction with a 
reducing agent Non-reducing SDS-PAGE analysis of wild type G-CSF proteins recovered using Refold 
. protocols A-F showed that each contained predominantly a single molecular weight species of the size 
expected for monoraeric G-CSF (approximately 17 kDa under non-reducing conditions). In contrast, when 
the S-Sepharose column pools from G-CSF (C17S/Q173C) Refolds A-D were analyzed by non-reducing 

20 SDS-PAGE, the final product band was broad and contained a number of different apparent molecular 
weight species in the monomeric range. Presumably the different molecular weight, monomeric species 
represent different disulfide isoforms of the G-CSF (C17S/Q173C) protein. The G-CSF (C17S/Q173C) 
protein recovered from refolds B and F ran as a single sharp band that comigrated with wild type G-CSF, 
indicating that a single, predominant folded species had been recovered. The data show that addition of 

25 cysteine during the solubilization and refolding steps significantly enhances the yield of properly folded G- 
CSF (C17S/Q173C:) protein. Although not wishing to be bound by any particular theory, we postulate that 
the added cysteine forms a mixed disulfide with the free cysteine residue in the mutein. The noixed disulfide 
limits possible disulfide rearrangments that could occur involving the fiee cysteme residue. (Cysteine may 
be more effective than DTT because DTT typically does not form mixed disulfides due to a 

30 thermodynamically preferred intramolecular bond that forms iq}on oxidation. 



SUBSTITUie SHEET (RULE 26) 



wo 01/87925 



PCT/USOl/16088 



48 

Exanqile 12 

Comparison of G*CSF Protein Stabilities Prepared in the Presence and Absence of Cysteine 
Wild type G-CSF and G-CSF (C17S/Q173C) proteins prepared as described in Example 11 using 
Refold procedure A (no reducing agent, no copper sul&te) and Refold procedure F (25 mM cysteine, 40 
5 copper sulfate) were placed at 50°C at pH 4 and pH 8. At times 0, 5 minutes, 30 minutes, 1, 2, 3, 4, S, and 
20 hours, tiie protein sanq)les were centrifuged to remove any denatured protein precipitates. Aliquots were 
removed from the siq>ematants and frozen. At the end of the e3q)eriment, all aliquots were analyzed by non- 
reducing SDS-FAGE to determine what portion of the original G-CSF protein sample remained in solution 
and was monomeric. Each protein's soluble half-life was determined based on relative band intensities as 
10 visualized on the gel. The results are shown in Table 9. 



Table 9 

Stabilities of G-CSF Proteins Prepared Using Different Refold/Renaturation Procedures 



Protein Sample 


pH 


Estimated Half-life 


G-CSF WT Refold A 


4 


3-4 hours 


G-CSF WT Refold F 


4 


3-4 hours 


G-CSF WT Refold A 


8 


~1 hour 


G-CSF WT Refold F 


8 


~1 hour 


G-CSF (C17S/0173C) Refold A 


4 


~30 minutes 


G-CSF (C17S/0173C) Refold F 


4 


> 20 hours 


G-CSF (C17S/0173C) Refold A 


8 


< 15 minutes 


G-CSF (C17S/0173C) Refold F 


8 


>20 hours 



15 

The results show that wild type G-CSF has a longer soluble half-life at pH 4 than at pH 8, \^ch is 
consistent with results previously reported by Arakawa ct al.(1993). The soluble half-life of wild type G- 
CSF was not substantially different whether the protein was refolded using Refold Procedure A or F. In 
contrast, G-CSF (C17S/QI73C) had a much longer soluble half-life when the protein was refolded using 

20 Procedure F (> 20 hours) rather than Procedure A (<30 minutes). Thus, m addition to increasing the 
recovery of properly folded G-CSF cysteine muteins, use of cysteine in the solubilization/refolding process 
increases the thermal stability of the final product. 

Additional studies can be performed to compare the stabilities of G-CSF cysteine muteins to wild 
type G-CSF. For example, a matrix of experiments can be performed by exposing the proteins to various 

25 pHs, temperatures and serum concentrations. At various time points, the intregrity of the proteins can be 
moniotered by assays such as, but not limited to, the NFS60 in vitro cell proliferation bioactivity assay 
described in Example 8, size exclusion chromatography. Circular Dichroism, ELISA assays and Westem 
blot analysis. 

30 Example 13 

In Vivo Efficacy of PEG-G-CSF Cysteine Muteins 

Groups of three male Sprague Davdey rate, weighing - 320g each, received a single intravenous 
injection (lateral tail vem) of wild tjpe r^onirinant G-CSF (prepared by Bolder BioTechnology), 
Neiq)ogcn® (a Fec<Hrf)inant G-CSF sold by Amgen, Inc.) or PEG-L3C at a dose of 100 ^ig/kg. Protein 
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concentrations were determined using a Bradford dye binding assay. At selected time points blood saniples 
(0.3 to 0.4 ml) were drawn &om die rats into EDTA anti-coagulant tubes. Aliquots of the blood samples 
weie sent to a commercial firm for a conq)lete blood cell (CBC) count. The remainder of the blood sample 
was centrifuged and the plasma frozen at -80°C. Blood samples were drawn at 0.25 , 1.5, 4, 8, 12, 16, 24, 
5 48, 72, 96, 120 and 144h post-injectioa A 0 h baseline sample was obtained - 24 h prior to injection of the 
test con5)Ounds. Tables 10 and 1 1 show the mean blood neutrophil and total white blood cell counts for the 
different test groups over time. All three test compounds stimulated an increase in peripheral white blood 
cells and neutrophils over baseline values. White blood cell and neutrophil counts for the test groups 
receiving wild type recombinant G-CSF and Neupogen® peaked - 24 h post-injection and returned to 

10 baseline values by 48 h. In contrast, white blood cell and neutrophil counts for the rats receiving PEG- 
L3C peaked -48-72 h post-injection and did not return to baseline values until - 120 h post-injection. Peak 
white blood cell and neutrophil levels observed in the rats receiving PEG-L3C were significantly higher than 
for the groups receiving wild type recombinant G-CSF or Neupogen® (p<0.05). The data indicate that 
PEG-L3C is capable of stimulating an increase in circulating neutrophil and white blood cells, and that the 

15 absolute increase in peripheral white blood cell counts and neutrophils is greater and longer lasting than that 
seen with wild type G-CSF or Neupogen®. Similar experiments can be performed to demonstrate efficacy 
of other PEGylated G-CSF cysteine muteins (C17 or C17S versions). Similar studies also can be performed 
using the subcutaneous route for administration of the proteins. 

20 Table 10 

Effects of G-CSF, Neupogen® and PEG-L3C on Neutrophil Blood Cell Counts FoUowing Single 
Intravenous Administration of the Proteins (100 iig/kg) 



Tiiiie(Hr) 




Neutrophils 
Mean+/-SE 
(cells/id blood) 






G-CSF" 


Neupogen 


PEG-L3C 


0 


1,147+/- 167 


1,906+/- 564 


1.596+/- 462 


4 


6.752+/- 923 


4,504+/- 549 


"4,237+/- 624 


8 


8.437 +/- 546 


5.525+/- 894 


"5,939+/- 664 


12 


10.744+/- 549 


11,891+/- 1,545 


"8,470+/- 833 


24 


11.035+/- 788 


11,148+/- 977 


"14,849+/- 1,398 


48 


2.355 +/- 218 


2,610+/- 245 


"-'18.488+/- 2.954 


72 


2.113+/- 438 


3.077+/- 590 


"••=17.353+/- 2.515 


96 


2.086+/- 496 


2,675+/- 673 


"•"5,467+/- 914 


120 


2,179+/- 373 


2,063+/- 469 


2,390+/- 238 



Wild type G-CSF prepared by Bolder BioTechnology, Inc. 
25 p< 0.05 versus 0 hour neutrophil levels 

^ p< 0.05 versus G-CSF and Neupogen at same tune point 
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Table 11 

Effects of G-CSF, Neupogen® and PEG-L3C on White Blood Cell Counts Following Single 



Intravenous Administration of the Proteins (100 us/kg) 



Tiine(Hr) 


White Blood Cells 
Mean+/-SE 
(cells/|xl blood) 




G-CSF' 


Neupogen 


PEG-L3C 


0 


11,100+/- 252 


11,100+/- 829 


12.900+/- 1,320 


4 


16,000+/- 1.059 


13,600+/- 570 


13,700+/- 1,923 


8 


15,200+/. 371 


14,900+/- 260 


13.800+/- 1,044 


12 


18,400+/- 240 


20,100+/- 674 


"16,700+/- 586 


24 


23,900+/- 1,110 


25,500 +/- 1,734 


"29.200+/- 2,321 


48 


14,700+/- 426 


15,300+/- 1,715 


"•■^ 37,400 +/- 4,971 


72 


15,300+/- 426 


14.800+/- 764 


"•'37,800+/- 4.715 


96 


14,200+/- 1,000 


14.700+/- 689 


"18,100+/- 2,550 


120 


11,000+/- 2,651 


11,300+/- 1,477 


13,800+/- 1,189 



* Wild type G-CSF prepared by Bolder BioTechnology, Inc. 
5 ^ p< 0.05 versus 0 hour white blood cell levels 

^ p< 0.05 versus G-CSF and Neupogen at same time point 



Plasma G-CSF and PEGylated G-CSF cysteine mutein protein levels can be quantitated using 
commercially available G-CSF ELISA kits (R & D Systems, Inc.). Titration experiments can be performed 
10 to determine the relative sensitivity of the EUSA for detecting wild type G-CSF, unmodified G-CSF 
cysteine muteins and PEGylated G-CSF cysteine muteins. Similar studies can be performed using the 
subcutaneous route of administration of the proteins. 

Plasma concentrations of the proteins from the efScacy experiment outlined above in Example 13 
were measured using human G-CSF ELISA kits purchased from R&D Systems, Inc. Results are shown in 
15 Table 12. The results mdicate that 20 kDa-PEG-L3C has a significantly longer circulating half-life than 
wild type G-CSF or Neupogen® following intravenous administration of the proteins to rats. 

Table 12 

Plasma concentrations of G*CSF, Neupogen® and 20 kDa-P£&L3C Following a Single Intravenous 
Administration of the Proteins (dose of 100 ^g/kg) 

20 



Time Post-injection 
(hour) 


G-CSF' 
(ng/ml) 


Neupogen 
(ng/ml) 


20 kDa-P£G-L3C 
(ng/ml) 




Mean+/-SJ>. 


Mean +/- S J>. 


Mean+/-S.D. 


0 


0+/-0 


0+/-0 


0+/-0 


0.25 


6,974+/- 1,809 


7,546+/- 486 


9,667+/- 1,382 


1.5 


1,866+/- 292 


2,083+/- 461 


8,368+/- 1,215 


4 


399+/- 73 


534+/- 131 


7,150+/- 892 


8 


101 +/- 21 


167+/- 26 


5,692+/- 1,094 


12 


14+/- 5 


26+/- 1.1 


4,165+/- 783 


16 


2+/-3 


2.9+/- 0.5 


3,669+/. 513 


24 


0.9+/- 0.3 


0.08 +/- 0.03 


2,416+/- 462 


48 


0.16 +7- 0.01 


0+/-0 


773+/- 137 


72 


0.08+/. 0.02 


0+/-0 


36+/- 36 
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Time Post-injection 
(hour) 


G-CSF' 
(ng/mi) 


Neupogen 
(ng/ml) 


20 ld)a-PEG-L3C 
(ng/ml) 




Mean +/- S.D. 


Mean+/-S.D. 


Mean +/- S.D, 


96 


0.11+/- 0,02 


0+/-0 


0.62+/- 0.13 


120 


0.05 +/- 0.02 


0+/-0 


0.15+/- 0,02 


144 


0.03 +/- 0.02 


0+/-0 


0.03 +/. 0.01 



' wad type G-CSF prepared by Bolder BioTechno: 



lOgy, Inc. 



In vivo efficacy of the PEGylated G-CSF cysteine muteins (C17 or C17S versions) can be 
measured in normal or neutropenic rodents such as mice or rats by demonstrating that the proteins stimulate 

5 increases in circulating neutrophil levels and granulopoiesis compared to vehicle-treated animals. G-CSF 
stimulates neutrophil levels in normal and neutropenic rodents at a dose of 100 \ig/kg (Kubota et al., 1990; 
Kang et al., 1995). For demonstratmg efficacy m normal mice, groups of 5 mice (wei^Jiing - 20 g each) 
can receive subcutaneous injections of G-CSF, PEG-G-CSF cysteine muteins or placebo (vehicle solution) 
at specified intervals for up to five days. Normal mice such as ICR mice can be purchased from a 

10 commercial vendor. On day 6 the animals can be sacrificed and blood samples collected for complete blood 
cell count (CBC) analysis. Hematopoietic tissues (liver and spleen) can be collected, weighed and fixed in 
formalin for histopathologic analyses to look for evidence of increased granulopoiesis. Bone marrow can be 
removed from various long bones and the sternum for xmit particle preps and histopathologic analysis to 
look for evidence of increased granulopoiesis. Comparisons between groups should be made using a 

15 Students T test for single comparisons and one-way analysis of variance for multiple comparisons. P< 0.05 
should be considered significant. The PEGylated G-CSF cysteine muteins should stimulate greater 
increases in circulating neutrophil levels and granulopoiesis in the mice compared to the vehicle-treated 
mice. Efficacy of the PEGylated G-CSF cysteine muteins modified with 5 kDa, 10 kDa, 20 WDa or 40 kDa 
PBGs can be tested when administered once, once per day, every other day, or every third day. In mitial 

20 experiments, different groups of mice can receive subcutaneous injections of 0.0032, 0.016, 0.08, 0.4 and 2 
^ig per injection of the PEGylated G-CSF cysteine muteins. Control mice can receive vehicle solution only. 
Additional control groups can receive wild t>pe G-CSF (2 fig/ every day (ED) for 5 days) and 2\ig wild type 
G-CSF using the same dosing regimen as the PEGylated G-CSF cysteine muteins. 

Efficacy of the PEGylated G-CSF cysteine muteins also can be demonstrated in neutropenic mice. 

25 Neutropenia can be induced by treatment with cyclophosphamide (CPA; 100 mg/kg), vAich is a commonly 
used myelosuppressive chemotherapeutic agent and relevant to tiie human clinical setting. G-CSF 
accelerates recovery of normal neutrophil levels in cyclophosphamide-treated animds (Kubota et al., 1990; 
Kang et al., 1995; Matsuzaki et al., 1996). Mice (-20g) can receive an intraperitoneal injection of 
cyclophosphamide on day 0 to induce neutropenia. The animals should be divided into different groups, 

30 which should receive subcutaneous injections of G-CSF, PEGylated G-CSF cysteine muteins or placebo at 
q)ecified intervals for up to five days. One control group should not receive cyclophosphamide but should 
receive placebo injections. Efficacy of the PEGylated G-CSF cysteine muteins modified with 5 kDa, 10 
kDa, 20 kDa or 40 kDa PEGs can be tested when administered once, every other day, or every third day. In 
initial cxp^im^ts, diflfepent groups of mice can receive subcutaneous injectiofls of 0.0032, 0.016, 0.08, 0.4 
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and 2 \ig per injection of the PEGylated G-CSF cysteine muteins. Control mice can receive vehicle solution 
only. Additional control groups can receive wild type G-CSF (2 \ig/ every day (ED) for 5 days) and 
2^g/injection of wild type G-CSF using the same dosing regimen as the PEGylated G-CSF cysteine muteins. 
On days 0-10, five mice per group can be sacrificed and blood and tissue samples analyzed as described for 
5 the normal mouse experiments above. The PEGylated G-CSF cysteine muteins should stimulate an 
accelerated increase in circulating neutrophil levels and granulopoiesis in the mice compared to the vehicle- 
injected, CPA-injected control group. 

Alternatively, efficacy of PEGylated G-CSF cysteine muteins can be demonstrated in neutropenia 
studies using a rat model. G-CSF accelerates the recovery of normal neutrophil levels in rafe treated with 

10 rayleosuppressive chemotherapeutic agents. In this case, groups of Spague Dawley rats (weighing '-SOOg 
each) can receive an intraperitoneal dose of CPA (100 mg/kg) at Day 0 to induce neutropenia. The animals 
can then be divided into three groups, those who receive subcutaneous injections of G-CSF, PEGylated G- 
CSF cysteine muteins or placebo at specified intervals for up to 10 days. One control group can receive 
placebo injections rather than cyclophosphamide. In initial experiments, efScacy of the PEGylated G-CSF 

15 cysteine muteins modified with 10 kDa, 20 kDa and 40 kDa PEGs can be measured by performing 
subcutaneous doses of -0.1 ^g-500 ^ig/kg (with the preferential range being 1-100 ^g/kg) when doses are 
administered once, every day, every other day or every third day. An additional control group can receive 
commercially available wild type G-CSF (100 ^g/kg) every day for S days and another control group can 
receive wild type G-CSF with the same dose and dosing regimen as with the PEGylated G-SCF cysteine 

20 mutants. Control rats can receive vehicle solution only. On days 0-6, 8, 10, 12, and 14 blood samples can 
be collected for CBC analysis. At the completion of die time course, die rats can be sacrificed for collection 
of the hematopietic tissues and bone marrow to investigate evidence of increased granulopoiesis. The 
PEGylated G-CSF cysteine mutants should stimulate an accelerated increase in circulating neutrophil levels 
and granulopoiesis in the rats conopared to the vehicle-injected, CPA mjected control group. 

25 

Example 14 

Cloning, Expression, Purification and Bioactivity of Wild Type GM-CSF 
A. Cloning DNA sequences encoding GM-CSF. We cloned and sequenced a cDNA encoding 
human GM-CSF by RT-PCR of total RNA isolated from the human bladder carcinoma cell line 5637 

30 (obtained from the American Type Culture Collection). A cDNA encoding G-CSF was amplified by PCR 
from total RNA isolated from the human bladder carcinoma cell line 5637 (American Type Culture 
Collection). The cells were grown in RPMI 1640 media supplemented with 10% FBS, 50 units/ml penicillin 
and 50 ^g/nfil streptomycin. RNA was isolated from the cells using an RNeasy Mini RNA isolation kit 
purchased from Qiagen, Inc. (Santa Clarita, CA) following the manufacturer's directions. First strand 

35 synthesis of single-stranded cDNA was accomplished using a 1st Strand cDNA Synthesis Kit for RT-PCR 
(AMV) from Boehringer Mannheim Corp and random hexamers were used as the primer. Subsequent PCR 
reactions using the products of the first strand synthesis as template were carried out with forward primer 
BB267 (5 > GAC ACT OCT GCT GAG ATG AAT G > 3; SEQ ff) NO:75) and reverse primer BB268 (5 > 
CTT GTA GTG GCT GGCCAT CAT G > 3; SEQ ID NO:76), Primer BB268 anneals to the 5' end of the 
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coding sequence for the GM^ilSF secretion signal and the reverse primer, BB268, anneals to the 3' end of 
the GM-CSF coding sequence. The resulting ~ 450 bp PGR product was digested with Hind UL and Bam 
HI, gel purified and cloned into pODNA3.1(+) vector that had been digested with Hind UL and Bam HI, 
alkaline phosphatase treated, and gel purified A clone with the correct DNA sequence was designated 
5 paDNA3.1(+)::GM-CSFfus or pBBT267, We used PGR to modify this GM-CSF clone for periplasmic 
e}qpression in £. coU, When expressed in E, coli, via secretion to the periplasm, GM-CSF does not contam 
an added N-terminal methionine and has an amino acid sequence identical to naturally occurring GM-CSF 
(Lee et aL, 1985). In order to express a secreted form of GM-CSF, PGR was used to fuse the leader 
sequence of the E. coli heat-stable enterotoxin (STII) gene (Picken et al., 1983), preceeded by an Nde I 

10 restiction site, to the amino-terminal coding sequence of mature GM-CSF. In addition, a TAA stop codon, 
followed immediately by an Eco RI restriction site, was added following the carboxy-terminal residue, 
E127. At the same time, codons for prolines at positions 2, 6, 8, 12, 1 17 and 124 were all changed to CCG, 
and the codon for leucine at position 114 was changed to CTG. The PCR reaction used forward primer 
BB300 (5> CGC AAC GCG TAG GCA GCA CCG GCC CGC TCG CCG AGC CCG AGC ACG CAG 

15 CCG TGG GAG >3; SEQ ID NO:77) and reverse primer BB301 (5> CGC GAA TTC TTA CTG CTG 
GAC CGG CTG CCA GCA GTC AAA CGG GAT GAC CAG CAG AAA >3; SEQ ID NO:78) with 
pBBT267 as ten^late. The resulting ~ 400 bp PGR product was digested with Mlu I and Eco RI, gel 
purified, and cloned into pBBT227 which is described m Example 9 above. pBBT227 DNA was digested 
with Mlu I and Eco RI, alkaline phosphatase treated, and run out on a 1% agarose gel. The ^ 2.4 kb vector 

20 fragment was purified and used in ligation. The resulting recombmants carry a complete stn leader fused to 
GM-CSF and this "stII-GM-CSF" construct can be excised as an Nde I - Eco RI firagmeat of 450 bp. One 
clone with the correct sequence was designated pUC18::stII-GM-CSF. For expression studies the Nde I - 
Eco RI fi:agment of this plasmid was subcloned into the expression vector pBBT257, v4iich is described in 
below. The resultmg plasmid, pBBT257; stll-muGM-GSF, or pBBT271 was introduced into E coli W3 1 10 

25 for expression. 

The plasmid pBBT257 was derived from the expression vector pCYB 1 (New ^gland BioLabs) by 
deleting the ampicillin resistance gene of pCYBl and replacing it with the gene for tetracycline resistance 
derived from the classic cloning vector pBR322 {Bolivar et al, 1977) In both pBBT257 and pCYBl, 
expression of the cloned gene is under the control of the tac promoter, which is regulated by the product of 

30 the plasmid-bome lacf gene* These vectors allow genes to be expressed as unfused proteins or as fiisions to 
a chitin binding domain; our constructs were created so that the proteins are expressed as unfused proteins. 
Plasmid pBBT257 was constructed as follows. The tetracycline resistance gene (Tc^ gene) of plasmid 
pBR322 (purchased from New England bioLabs) was amplified by PCR using primers BB228 (5> CGC 
GCT GCA GTT GTC ATG TTT GAC AGC TTA TCA TC >3; SEQ ID N0:41) and BB229 (5 > CGC 

35 GCT GCA G AT TTA AAT TAG CGA GGT GCC GCC GGC TTC CAT > 3; SEQ ID NO:42). Forward 
primer BB228 anneals to nucleotides I through 25 of the pBR322 sequence (GenBank Accession # 
J01749), which are located upstream of the Tc*^ gene and include the "-35" portion of the Tc^ gene 
promoter. Oligo BB228 contains an added Pst I site for cloning puiposes. The reverse primer BB229 
aimeals to nucleotides 1277 tiirough 1300, v^ch are located immediately downstream of the translational 
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stop codon that follows the coding sequence of the Tc^ gene. BB229 contains an added Dra I site for 
cloning purposes. The 40 |al PGR reaction was performed in SO vM KCl, 10 mM Tris-HCl (pH 9.0 @ 25'' 
C), 0.1% Triton® X-100, 1.5 mM MgCl2 and included dNTPs at 200 \M each, 20 pmole of each primer, 
0.5 ng of pBR322 DNA, 2.5 units of Taq polymerase (Promega), and 0.5 units of PFU polym^se 
5 (Stratagene). The PGR reaction consisted of: 95** C for 3 minutes, 25 cycles of [94*^ C for 30 seconds, 60° C 
for 30 seconds, 72° C for 90° seconds] followed by a 4° C hold. The resulting -1300 bp product was gel 
purified, digested with Pst I and Dra I and used in a ligation reaction as described below. Purified pCYBl 
DNA was digested with Pst I and Swal and treated with calf intestine alkaline phosphatase according to the 
vendor (New England BioLabs) protocols. Pst I and S)m I each cut the vector once and flank the ampicillin 

10 resistance (ApR) gene. The digestion products were cleaned up using a Qiaquick PGR Cleanup Kit 
(Qiagen) according to the vendor protocol and subsequently run out on a 1 % agarose gel. The -5.3 kb 
vector fragment, deleted for the Ap^ gene, was gel purified and ligated with the Pst I - Dra I cut PGR 
product containing the Tc^ gene. Both Dra I and Sw I generate blunt-ended digestion products that can be 
ligated together. The ligation reaction was used to transform E: coli DH5a and tetracycline-resistant 

15 transformants were selected. Three isolates were subsequently analyzed and all were found to be sensitive 
to ampicillin. Restriction endonuclease digestion products obtained from these isolates were also consistent 
with deletion of the -1500 bp Pst I and Swa I fragment containing the Ap^ gene and its replacement by the 
-1300 bp Pst I - Dra I fragment that carries the Tc^ gene. One isolate, designated pBBT257, was chosen 
for use in expression of recombinant proteins. 

20 B. Expression of Wild Type GM-CSF in R colL For expression of secreted GM-CSF, pBBT271 

[pBBT257::STII-GM-GSF] and the pBBT257 parent vector, were transfonned into coli W3110. The 
resulting strains were designated as BOB340: W3110(pBBT2S7) and BOB350: W3110(pBBT271). Fresh 
saturated overnight cultures were inoculated at 0.05 OD @ A^oo in LB containing 10 ^g / ml tetracycline. 
These 100 ml cultures were grown in a 500 niL baffled shake flask at 28^G in a gyrotory shaker water bath 

25 at -250 rpm. When the culture reached a density of - 0.6 OD, IPTG was added to a final concentration of 
0.5 mM and the induced cidture was then mcubated overnight for '^^16 h. San^les of induced and uninduced 
cultures were analyzed by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) on precast 16% Tris- 
glycine polyacrylamide gels and stained with Goomassie Blue. The induced culture of BOB350XGM-GSF) 
gave a band at approximately 14 kDA, which is consistent with the mature GM-GSF molecular weight This 

30 band was not detected in an uninduced culture of BOB350 or in induced or unmduced cultures of BOB340, 
the vector-only control. Western blot analyses showed that this -14 kDa band reacted strongly with an anti- 
human GM-CSF antiserum (R&D Systems). This antiserum did not recognize proteins in uninduced 
cultures of BOB340 and BOB 350 or in the induced culture BOB340, the vector only control. These 
Western blots also showed that this -14 kDa band co-migrated with a commercial, E, co//-derived himian 

35 GM-CSF standard purchased from R&D Systems. This result suggests that the STII leader peptide has 
been removed, which is consistent with the protein having been secreted to the periplasm. N-terminal 
sequencing studies presented below indicate the STII signal sequence was properly processed. 

The 16 hour post-induction samples from these cultures also were subjected to osmotic shock 
based on the procedure of Koshland and Botstein (19S0). This procedure ruptures the E. coli outer 
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membrane and releases the contents of the periplasm into the surrounding medium. Subsequent 
centrifugation separates the soluble periplasmic conq)onents (recovered in the supernatant) firom 
cytoplasmic, insoluble periplasmic, and cell-associated conaponents (recovered in the pellet). Little of the 
GM-CSF protein synthesized was recovered in the supernatant The bulk of the GM-CSF remained 
5 associated with the pellet. This indicates that while the protein appears to be processed and secreted to the 
periplasm, it accumulates there primarily in an insoluble form. Similar results have been reported by others 
for GM-CSF secreted to the E coli periplasm (Libby et al., 1987; Greenberg et al, 1988). 

C. Purification of Wild Type GM-CSF. Wild type GM-CSF was expressed and purified at a 
larger scale using the following protocols. A fresh saturated overnight culture of BOB350 (wild type) was 

10 inoculated at - 0.05 CD @ A^oo in LB containing 10 ng / ml tetracycline. The 400 ml culture was grown in 
a 2L baffled shake flask at 28^C in a gyrotory shaker water bath at -250 rpm. When the culture reached a 
density of - 0.6 OD, IPTG was added to a final concentration of 0.5 mM. The induced culture was then 
incubated overnight for -16 h. The cells were pelleted by centrifugation and firozen at -80° C. The cell 
pellet was thawed and treated with 5 mL of B-PER ™ bacterial protein extraction reagent according to the 

15 manufacturer's (Pierce) protocols. The insoluble portion, and the bulk of the GM-CSF protein, was 
recovered by centrifiigation and resuspended in B-PER. This mixture was treated with lysozyme (200 
]igfmL) for 10 min to fiirther disrupt the cell walls, and MgCl2 (10 mM final) and protease-firee DNAse (2 
|ig/ml) were added. Insoluble GM-CSF was collected by centrifugation and washed, by resuspension in 
water and recentrifiigation, to remove most of the solubilized cell debris. For refolding, the resulting pellet 

20 containing insoluble GM-CSF was dissolved in 10 ml of 8 M urea, 25 mM cysteine in 20 mM Tris Base. 
This mixture was stirred for 30 min at room temperature then diluted into 100 ml of 20 mM Tris, 40 }3M 
copper sul&te, 15% glycerol, pH 8.0. This refold mixture was held at 4°C for 2 days and then centrifuged 
and loaded onto a 5 ml Q-Sepharose column (Pharmacia HiTrap) equilibrated in 20 mM Tris, pH 8.0 
(Buffer A). The bound proteins were eluted with a linear salt gradient ftom 0-35% Buffer B (IM NaCl, 20 

25 mM Tris, pH 8). Colunm firactions were analyzed by non-reducing SDS-PAGE. GM-CSF eluted at 
approximately 230 mM NaCl. Fractions containing primarily GM-CSF were pooled. 

The Q-Sepharose pool was diluted with an equal volume of 30% ammonium sulfate and wanned to 
room temperature before being loaded onto a 1 mL Phenyl HP column (Pharmacia HiTmp) previously 
equilibrated with 15% ammonium sulfate in 20 mM sodiinn phosphate, pH 7.5. Purified GM-CSF was 

30 recovered fi-om the column by elution with a reverse salt gradient (15% ammonium sulfate to 0% 
ammonium sulfate in 20 mM sodium phosphate, pH 7.5). The Phenyl HP column elution profile for GM- 
CSF showed a single major peak, eluting at approximately 6.5% ammonium sulfate. Column fractions 
across the peak were analyzed by non-reducing SDS-PAGE. Fractions containing GM-CSF and no visible 
contaminants were pooled. The final yield of wild type GM-CSF as determined by Bradford analysis, was 

35 about 2.6 mg from 400 ml of culture. N-terminal sequencing of wild type GM-CSF using automated Edman 
degradation chemistry yielded the sequence APARSPS, which identically matches tiie first seven amino 
acids of mature human GM<CSF, and indicates that the N-terminus is correctiy processed (Lee et aU 1985). 
Purified wild type GM-CSF and commerci^ly available GM-CSF (£. c^7//-^ressed; R&D Systems) co- 
migrated under reducing and non-reducing conditions as shown by Western blot analysis. Both proteins 
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exhibited the expected mobility shift to a higher apparent molecular weight under reducing conditions 
because of the disruption of the intramolecular disulfide bonds. 

In Vitro Bioactivities of Wild Type GM-CSF, A cell proliferation assay using the human TF-l 
eiythroleukemic cell line (Kitamura et ah, 1989) was developed to measure bioactivity of wild type GM- 
5 CSF. The human TF-l cell line was obtained from the American Type Culture Collection. The cells were 
maintained in RPMI 1640 media supplemented with 10% FBS, 50 units/ml penicillin, 50 fig/ml 
streptomycin and 2 ng/ml recombinant human GM-CSF (£. coW-derived; R&D Systems). In general, the 
bioassays were set up by washing the TF-l cells three times with RPMI 1640 media (no additives) and 
resuspending the cells at a concentration of IxloVml in RPMI 1640 media containing 10% FBS, 50 xmits/ml 

10 penicillin and 50 ^g/ml streptomycin (assay media). Fifty ^l (5x10^ cells) of the cell suspension was 
aliquotted per test well of a flat bottom 96 well tissue culture plate. Serial dilutions of the protein samples to 
be tested were prepared in assay media . Serial dilutions of commercial recombinant human GM-CSF {E, 
co/f-expressed; R&D Systems) were analyzed in parallel. Fifty |il of the diluted protein samples were added 
to the test wells and the plates incubated at 3/C m a humidified 5% CO2 tissue culture incubator. Protem 

15 samples were assayed in triplicate wells. After ^ 3 days, 20 ^1 of an MTS/PMS mixture <CellTiter 96 
AQueous One Solution, Promega) was added to each well and the plates uicubated at 37^C in the tissue 
culture incubator for 1-4 h. Absoibance of the wells was read at 490 nm usmg a microplate reader. Control 
wells contained media but no cells. Mean absorbance values for the triplicate control wells were subtracted 
&om mean values obtained for the test wells. ECsqS, the concentration at half maximal stimulation, were 

20 calculated for each sample to compare bioactivities of the proteins. 

The TF-l cell line shows a strong proliferative response to GM-CSF, as evidenced by a dose- 
dependent increase in cell number and absorbance values. Commercial GM-CSF and GM-CSF prepared by 
us had mean EC50S of 97 and 105 pg/ml, respectively, m the bioassay (Table 13). 

25 Example 15 

Construction, Expression, Purification and Bioactivity of GM-CSF Cysteine Muteins 
A. Construction of GM-CSF Cysteine Muteins. Thirteen mutant GM-CSF genes were 
constructed using site-directed PCR-based mutagenesis as described in general by Itmis et al., 1990) and 
Horton et all, (1993) and in the Example 9. We constructed five muteins in the amino-terminal region 

30 proximal to Helix A [*-lC (the addition of a cysteine residue onto the natural amino terminus), AlC, A3C, 
S5C and S7C]; one mutcm in the B-C loop fS69C]; three muteins in the C-D loop [E93C, T94C, and 
T102C]; and three muteins in the carboxy-terminal region distal to Helix D [V125C, Q126C and *128C (the 
addition of a cysteine residue to the natural carboxy-temiinus)]. We also constructed one mutein at a 
putative N-linked glycosylation site [N27C], which is located at the distal end of Helix A. The template 

35 used for the mutagenic PCR reactions was plasmid pBBT268 in v^ch the STII-GM-CSF gene is cloned as 
an Nde I - Eco RI fiagment in pUC18. PCR products were digested with appropriate restriction 
endonucleases, gel-purified and ligated with pBBT268 vector DNA that had been cut with those same 
restriction enzymes, alkaline phosphatase treated, and gel-purified. Transformants fix>m these ligations were 
grown up and plasmid DNAs isolated and sequ^ced. The sequence of the entire cloned mutagenized PCR 
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fragment was deterniined to verify the presence of the mutation of interest, and the absence of any additional 
mutations that potentially could be introduced by the PGR reaction or by the synthetic oligonucleotide 
primers. 

For expression in E, coli as proteins secreted to the periplasmic space, the STII-GM-CSF genes 
5 encodmg the 13 muteins were excised from the pUC18-based p6BT268 derivatives as Nde I - Eco RI 
fragments of -450 bp, subcloned into the pBBT257 expression vector, and transformed into E. coli W3 1 10. 

Using procedures similar to those described here, one can construct other cysteine muteins of GM- 
CSF. The cysteine muteins can be substitution mutations that substitute cysteine for a natural amino residue 
in the GM-CSF coding sequence, insertion mutations that insert a cysteine residue between two naturally 

10 occurrmg amino acids in the GM-CSF coding sequence, or addition mutations that add a cysteine residue 
preceding the first amino acid, Al, of the GM-CSF coding sequence or add a cysteine residue following the 
temiinal amino acid residue. El 27 , of the GM-CSF coding sequence. The cysteine residues can be 
substituted for any amino acid, or inserted between any two amino acids, anywhere in the GM-CSF coding 
sequence. Preferred sites for substituting or inserting cysteine residues in GM-CSF are in the region 

15 preceding Helix A, the A-B loop, &e B-C loop, the C-D loop, and the region distal to Helix D. Other 
preferred sites are the first or last three amnio acids of the A, B, C, and D Helices. Some preferred positions 
for cysteine mutations are described in Table 13. Other preferred positions include R67C, G68C, L70C, 
R30C, T32C, A33C, E35C, N37C, T39C, E45C, D48C, Q50C, E51C, Q99C, T98C, B113C and E127C. In 
addition to the mutations described above, otiier preferred residues in these regions for creatmg cysteine 

20 substitutions are described in PCT/US98/14497. 

One also can construct GM-CSF muteins containing a free cysteine by substituthig another amino 
acid for one of the naturally occurring cysteine residues in GM-CSF that normally forms a disulfide bond. 
The naturally occurring cysteine residue that normally forms a disulfide bond with the substituted cysteine 
residue is now free. The cysteine residue can be replaced with any of the other 19 amino acids, but 

25 preferably with a serine or alanine residue. A free cysteine residue also can be introduced into GM-CSF by 
chemical modification of a naturally occurring amino acid using procediures such as those described by 
Sytkowski et al (1998). 

Multiple mutants containing two or more added free cysteine residues can also be constructed 
either by sequential rounds of mutagenesis using the procedures described in Examples 8, 9, 14 and 15 or 

30 alternatively by in vitro recombination of individual mutants to construct recombinant expression plasmids 
encoding muteins containing two or more free cysteines. The preferred multiple mutants would be those 
that combined two or more cysteine muteins that each retain high activity when PEGylated for example A3C 
plus S69C, S69C plus E93C, and ABC plus E93C. Other preferred multiple mutants can be deduced based 
on the data from Table 9 and Table 10 and would include combinations containing two or more mutations 

35 from the group including *-lC, AlC, A3C, S5C, S7C, S69C and E93C. 

Using procedures similar to those described in Examples 14 « 16, one can express the proteins in 
E. colU purify the proteins, PEGylate the proteins and measure their bioactivities in an m vitro bloassay. 
The proteins can be expressed cytoplasmically in E. coli or as proteins secreted to the periplasmic space. 
The muteins also can be e3^ressed in eukaryotic cells such as insect or manmialian oells, usmg procedures 
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similar to those described in FCT/CJSOO/00931, or related procedures well known to those skilled in the art 
If secretion from eukaryotic cells is desired, the natural GM-CSF signal sequence, or another signal 
sequence, can be used to secrete the proteins from etikaryotic cells. 

B. Expression and Purification of GM-CSF Cysteine Muteins. coli strains expressing the 13 
GM-CSF cysteine muteins were grown, induced and harvested using the protocols described for wild type 
GM-CSF in Example 14. The muteins were refolded and purified using the protocols described for wild 
type GM-CSF in Example 14. The muteine eluted from the Q-Sepharose column at approximately 200-230 
mM NaCl and from from the Phenyl HP column at approximately 6-8% ammonium sulfate. The muteins 
were recovered predominantly as monomers, with apparent molecular weights of 14 kDa by non-reducing 
SDS-PAGE. 

C. Bioactivities of GM-CSF Cysteine Muteins. The 13 purified GM-CSF cysteine muteins were 
assayed in the TF-1 cell proliferation assay. Protein concentrations were determined using a Bradford 
protein assay kit (Bio-Rad Laboratories). Commercial wild type GM-CSF and wild type GM-CSF prepared 
by us were analyzed in parallel on the same days to control for interday variability in the assays. All 13 
muteins stimulated proliferation of the TF-1 cells to the same extent as the wild type GM-CSF control 
protems. Mean ECsoS for the 13 muteins ranged from 80 to 134 pg/ml (Table 13). 

Table 13 



Properties of GM-CSF Cysteine Muteins 



GM-CSF 


Mutation 


Mean ECjo 


ECso Range' 


Protein 


Location 


± SD (pg/ml) 


(pg/ml) 


R&Dwt" 




97±5 


90 - 100 (6) 


BBTwt'' 




105 ±8 


90-115(14) 


*-ic 


N-terminus 


ni±5 


105-115(4) 


AlC 


N-teiminus 


80 + 0 


80-80(4) 


A3C 


Proximal to A Helix 


108 ±3 


105-110(4) 


S5C 


Proximal to A Helix 


125 ± 6 


120-130(4) 


S7C 


Proximal to A Helix 


106 ±6 


100-110(4) 


N27C 


A Helix 


134 ±30 


105 - 160 (4) 


S69C 


B-C loop 


103 ± 10 


90-110(4) 


E93C 


C-D loop 


103 ± 14 


90-115(4) 


T94C 


C-D loop 


120±4 


115-125(4) 


T102C 


C-D loop 


114±3 


110-115(4) 


V125C 


Distal to D Helix 


110±0 


110-110(4) 


Q126C 


Distal to D Helix 


126 ±9 


120-140(4) 


♦128C 


C-terminus 


124 ±3 


120-125(4) 



" Observed range of ECso values; number of assays in parentheses. 
Commercial wild type GM-CSF (R&D Systems) 
Wild type GM-CSF prepared by Bolder BioTechnology 

Example 16 

PEGylation, Purification and Bioactivity of GM-CSF Cysteine Muteins 
A. Preliminary PEGylation studies. Initial PEGylation reaction conditions were determined 
using AlC, S7C and S69C as the test proteins, TCEP [Tris {2-caiboxyethyl) phosphine]-HCl as the r-educing 
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agent and 5kDa cysteine reactive PEGs from Shearwater Polymers. Three (ig aliquots of the purified 
cysteine muteins or wild type GM-CSF were incubated with increasing concentrations of TCEP at room 
temperature in 100 mM Tris, pH 8.5 in the presence of excess 5 kDa maleunide-PEG or 5 kDa vinylsulfone- 
PEG (linear forms of a polyethylene glycol polymer composed of a molecular weight average of 5 kDa with 
5 a reactive maleunide or vinylsulfone group at one of the polymer ends). The maleimide and vmyl sulfone 
groups react with Michael nucleophiles, with a high selectivity for mercaptan groups such as those 
contained on cysteine side chains. After 90 min, the reactions were immediately analyzed by non-reducing 
SDS-PAGE. The amounts of TCEP and particular PEG reagent that yielded significant amounts of 
raonoPEGylated cysteine protein, without modifying wild type GM-CSF, were chosen for use in subsequent 

10 experiments. The titration experiments indicated that at pH 8.5, a 15-fold molar excess of TCEP and 20- 
fold excess of 5 kDa maleimide-PEG yielded significant amounts of monoPEGylated AlC protein and 
monoPEGylated S7C protein without detectable di- or tri-PEGylated protein. In the case of GM-CSF S69C, 
5 kDa vinylsulfone-PEG was preferred over 5 kDa maleunide-PEG, and yielded significant amounts of 
monoPEGylated S69C protein. Recombinant wild type GM-CSF was unreactive to die PEGs, even in the 

15 presence of a 50-fold molar excess of TCEP. Control experiments indicated that the muteins needed to be 
partially reduced to be PEGylated. 

B. Preparation and Purification of PEGylated GM-CSF Cysteine Muteins: Aliquots of 200 to 
300 \ig of 10 purified GM-CSF cysteine muteins were PEGylated to provide sufficient material for 
purification and characterization. The larger PEGylation reactions also were performed for 1.5 hr at room 

20 temperature. For each of the mutants, a 15-fold excess of TC^ and 20-fold excess of 5 kDa maleimide- 
PEG was used. The only exception was S69C where 5 kDa vinylsulfone-PEG was used. These reaction 
conditions yielded monoPEGylated protein for all ten muteins. At the end of the reaction time, fiie 
PEGylation mixture was diluted 20X with ice cold 20 mM Tris, pH 8.0 before being loaded quickly onto an 
Q-Sepharose column (1 mL, HiTrap) using conditions similar to those described for the initial purification 

25 of the GM-CSF muteins (25 mL gradient, 0-0.35 M NaCl in 20 mM Tris pH 8). The presence of the PEG 
moiety decreases the protein's affinity for the resin, allowing the PEGylated protein to be separated fi-om the 
non-PEGylated protein. Non-reducing SDS-PAGE analyses of the PEGylation reactions showed that only 
detectable PEGylated species was the PEG-GM-CSF cysteine mutein monomer, which migrates with an 
apparent molecular weight ^ 26 kDa. The chromatogram from the Q-Sepharose column showed two major 

30 protein peaks. The early eluting major peak (160-200 mM NaCl) was determined to be mono-PEGylated 
GM-CSF protein by SDS-PAGE. The second major peak (200-230 mM NaCl) was determined to be 
unreacted GM-CSF protein. Fractions from the early eluting peak containing predommantly 
monoPEGylated GM-CSF cysteine mutein were pooled and used for bioactivity measurements. All the 
GM-CSF muteins were PEGylated and purified by the identical protocol. The PEGylated proteins displayed 

35 similar apparent molecular weights by SDS-PAGE, except for the PEG-E93C and PEG-T94C muteins, 
which displayed slightly smaller apparent molecular weights. Four of the cysteine muteins in the N-terroinal 
region (*AC, AlC, A3C, and S7C) also have been PEGylated on a small scale using 10- and 20 kDa 
maleimide PEGs. These reactions were performed with 3 ^g of each mutein using the conditions described 
above, and analyzed by SDS-PAGE. Each of these proteins reacted readily with the 10 kDa and 20 kDa 
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PEG reagents, yielding monoPEGylated protein. 40 kDa-PEG'A3C was also prepared following the 
protocol described above. This protocol was scaled up to provide larger quantities of the 10 kDa-, 20 kDa- 
and 40-kDa-PEG-A3 C protein, 

C. Bioactivities of PEGylated GM-CSF Cysteine Muteins: We purified sufficient quantities of 7 
5 muteins (♦-IC, AlC, A3C, S5C, S7C, S69C and E93C) modified with a 5 kDa PEG for accurate protein 
concentration and specific bioactivity measurements. Biological activities of the 7 purified 5 kDa- PEG- 
GM-CSF cysteine muteins were measured in the TF-1 cell proliferation assay. Concentrations of the 
protems were determined using a Bradford dye binding assay. All of the PEGylated GM-CSF cysteine 
muteins showed similar dose-response curves and reached the same level of maximal growth stimulation as 
10 wild type GM-CSF. Mean EC50S for the PEG-GM-CSF cysteine muteins ranged fi-om 80 - 123 pg / ml 
(Table 14). 

Table 14 

Bioactivities of PEGylated GM-CSF Cysteine Muteins 

15 ■ _______ 



GM-CSF 
Protein 


5 kDa PEG Protein 

MeanECso 

± SD (pg/ml) 


5 kDa PEG Protein 
ECjo Range* 
(pg/ml) 


♦-1C 


96±5 


90-100(4) 


AlC 


115±4 


110-120(4) 


A3C 


106 ±3 


105-110(4) 


S5C 


80+11 


70-100(6) 


S7C 


123 ± 15 


110-140(4) 


S69C 


88±6 


80-95(4) 


E93C 


86±5 


80-90(4) 



* Observed range of ECso values, number of assays in parentheses. 

Biological activities of the A3C mutein modified with 10 ld)a-, 20 IcDa- and 40 IcDa-PEG 
20 molecules were measured in the TF-1 cell proliferation assay. Concentrations of the proteins were 
deteimmed using a Bradford dye bindmg assay. Each of the PEG-A3C protems stimulated proliferation of 
TF-1 cells. Mean EC50S for the 10 kDa-, 20 kDa- and 40 kDa-PEG A3C cysteine muteins were 78 +/- 3 
pg/ml, 1 13 +/- 5 pg/ml, and 300 +/- 50 pg / ml, respectively (N= 4 assays for each protein). 

D. Apparent Molecular weights A3C modified with SlcDa-, lOkDa-, 20 IcDa- and 40 icDa-K 
25 PEGs: The s^parent molecular weights of the PEGylated GM-CSF A3C proteins were determined by size 
exclusion HPLC (SEC) using a Biorad Bio-Sil SEC-400-5 column on a Beckman System Gold HPLC. An 
isocratic gradient consisting of Phosphate Buffered Saline was used as the eluant. Retention times for each 
protein were used to calculate molecular weights based on a standard curve generated with gel filtration 
protein standards (BioRad Laboraories, Richmond, CA). The PEG ylated proteins displayed dramatically 
30 increased apparent molecular weights relative to tibe non-PEGylated GM-CSF (Table 15). Larger PEGs 
increased the apparent molecular weightof the protein more than smaller PEGs. Similar data were recorded 
for PEGylated cysteine muteins of GH, IFN-a2, and G-CSF. 
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Table 15 

Apparent Molecular Weights of PEGylated GM-CSF Cysteine Muteins 
by size Exclusion Cliromatograptiy 



Protein 


Apparent SEC Molecular 
Weight (daltons) 


GM-CSF 


20,000 


5kDa-PEGA3C 


80,000 


10kDa-PEGA3C 


200,000 


20kDa-PEGA3C 


470.000 


40kDa-PEGA3C 


680,000 



5 

Example 17 

Cloning, Expression, Purification and Bioactivity of Wild Type Murine GM-CSF and Cysteine 

Muteins of Murine GM-CSF 

We cloned and sequenced a cDNA encoding the mature mouse GM-CSF by RT-PCR of total RNA 

10 isolated from the mouse EL4.IL-2 cell line (catalogue # TIB-181) obtained from the American Type Culture 
Collection (Rockville, MD). The cells were grown in DMEM media supplemented with 10% FBS, 50 
units/ml penicillin and 50 ^ig/ral streptomycin. The cells were induced for 6 or 24 h with 1 ^g/ml PHA-L 
(Sigma-Aldrich Chemical Company, catalogue # L-4144) and 10 ng/ml PMA (Sigma-Aldrich Chemical 
Company, catalogue # P-1585) in DMEM medium, 10% FBS, 50 units/ml penicillin, 50 ^ig/ml streptomycin 

15 at 37^C prior to RNA isolation. RNA was isolated from the cells using an RNeasy Mini KNA isolation kit 
purchased from Qiagen, Inc. (Santa Clarita, CA) following the manufacturer's directions. First strand 
synthesis of single-stranded cDNA was accomplished usmg a 1st Strand cDNA Synthesis Kit for RT-PCR 
(AMV) from Boehringer Mannheim Corp and random hexamers were used as the primer. A subsequent 
PCR reaction using the products of the first strand synthesis as tenq)late was carried out with forward primer 

20 BB481 [5> GCG AC GCG TAC GCA GCA CCC ACC CGC TCA CCC ATC ACT >3; SEQ ID NO:43] 
and reverse primer BB482 . BB481 anneals to the 24 nucleotides encoding the first eight amino acids of 
mature mouse GM-CSF. BB481 also adds, immediately 5' to this sequence, nucleotides that overlap the 
sequences encoding the carboxyterminal 4 amino acids of the E, coli stn signal sequence described above in 
Example 7 and by Picken et al. (1983). These 11 nucleotides include an Mm I restriction site. BB482 [5> 

25 GCG GAA TTC TTA TTT TTG GAC TGG TTT TTT GCA TTC AAA GGG >3; SEQ ID NO:44] anneals 
to the nucleotides encoding the carboxyterminal ten amino acids of mouse GM-CSF and adds a TAA 
translational stop codon and an Eco RI restriction site unmediately following the coding sequence. Both the 
6h and 24h RNA samples yielded a GM-CSF RT-PCR product The resulting ~ 400 bp PCR product from 
the 6h RNA sample was digested with Mu I and Eco RI, gel purified, and cloned into pBBT227 

30 [pUC18::sti[I-G-CSF(C17S)] which is described in Example 8 above. pBBT227 DNA was digested with 
Mu I and Eco RI, alkaline phosphatase treated, and run out on a 1% agarose gel. The ~ 2.4 kb vector 
fragment was purified and used in ligation. The resulting recombinants carry a complete stn leader fused to 
murine GM-CSF and this "stll-muGM-CSF" construct can be excised as an Nde I - Eco RI fragment of ~ 
450 bp. One clone with the correct sequence (Gough et al, 1984) was designated pUC18::sffl[-muGM-CSF 

35 or pBBT435. For expression studies the Nde I - Eco RI fragment t)f pBBT435 was subcloned into the 
e)q>ression vector pBBT257, which is described in Example 14 above. The resulting plasmid, 
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pBBT257::stII-iiiuGM-CSF, or pBBT456 was introduced into E coli W3110 for expression. Wild type 
mouse GM-CSF was expressed and purified using the protocols for expression and purification of human 
GM-CSF described in Example 14 above. 

Mutant mouse GM-CSF genes can be constructed using site-directed FCR-based mutagenesis as 
5 described in general by Innis et al., 1990) and Horton et all, (1993) and in the other Examples above. One 
mutein, T3C, was constructed in the amino-terminal region proximal to Helix A. The mutagenic PGR 
reaction was carried out using plasmid pBBT435 (described in Example 17) as template and forward 
primer BB504 [5> GCG AC GCG TAC GCA GCA CCC TGC CGC TCA CCC ATC ACT >3; SBQ ID 
NO:45] and reverse primer BB482 [5> GCG GAA TTC TTA TTT TTG GAC TGG TTT TTT GCA TTC 

10 AAA GGG >3; SEQ ID NO:46]. BB504 Changes the ACC codon for threonine at position 3 of mature 
mouse GM-CSF to a TGC codon for cysteine. The resulting - 400 bp PGR product was digested with Mlu I 
and Eco RI, gel purified, and cloned into pBBT435 that was digested with Mu I and Eco RI, alkaline 
phosphatase treated, and gel-purified. One clone with the correct sequence was designated pUC18::stII- 
muGM-CSF(T3C). For expression studies the Nde I - Eco RI firagment of this plasmid was subcloned into 

15 the expression vector pBBT257, which is described in Example 14 above. The resulting plasmid, 
pBBT257::s1II-muGM-CSF(T3C), or pBBT469 was mtroduced into E coli JM109 for c3q)ression. The T3C 
mutein of mouse GM-CSF was expressed and purified using the protocols for expression and purification of 
human GM-CSF described in Example 14 above. 

Using procedures similar to those described here, and m Examples 9 and 15 above, one can 

20 construct other cysteine muteins of mouse GM-CSF. The cysteine muteins can be substitution mutations 
that substitute cysteine for a natural amino residue in the GM-CSF coding sequence, insertion mutations that 
insert a cysteine residue between two naturally occurring amino acids in the mouse GM-CSF coding 
sequence, or addition mutations that add a cysteine residue precedmg the first amino acid of the mouse GM- 
CSF coding sequence or add a cysteine residue following the terminal amino acid residue of the mouse GM- 

25 CSF codmg sequence. The cysteine residues can be substituted for any amino acid, or mserted between any 
two amino acids, anywhere in the mouse GM-CSF coding sequence. Preferred sites for substituting or 
inserting cysteine residues are in the region preceding Helix A, the A-B loop, the B-C loop, the C-D loop, 
and the region distal to Helix D. Other preferred sites are the first or last three amino acids of the A, B, C, 
and D Helices. One also can construct muteins containing a free cysteine by substituting another amino acid 

30 for one of the naturally occurring cysteine residues in GM-CSF that normally forms a disulfide bond. The 
naturally occurring cysteine residue that normally forms a disulfide bond with the substituted cysteine 
residue is now free. The cysteine residue can be replaced witii any of the other 19 amino acids, but 
preferably with a serine or alanine residue. A free cysteine residue also can be mtroduced into GM-CSF by 
chemical modification of a naturaUy occurring amino acid using procedures such as those described by 

35 Sytkowskietal.(1998). 

Multiple mutants containing two or more added free cysteine residues can also be constructed 
either by sequential rounds of mutagenesis using the procedures described in Examples 9 and 15 above or 
alternatively by in vitro recombination of individual mutants to construct recombinant expression plasmids 
encoding muteins containing two or more free cysteines. The preferred multq)le mutants would be those 
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that combined two or more cysteine muteins that each retain high, or complete, specific activity when 
PEGylated. 

Using procedures similar to those described in Examples 12-14, 15 and 16, one can express 
purify, and PEGylate mouse GM-CSF muteins and measure biological activities of these proteins in an in 
vitro bioassay and in vivo efficacy models* The proteins can be expressed cytoplasmically in E. coli or as 
proteins secreted to the periplasmic space. The muteins also can be expressed in eukaryotic cells such as 
insect or mammalian cells, using procedures similar to those described in PCT/USOO/00931, or related 
procedures well known to those skilled in the art. If secretion from eukaryotic cells is desired, the natural 
GM-CSF signal sequence, or another signal sequence, can be used to secrete the proteins from eukaryotic 
cells. 

The purified mouse GM-CSF wild type protem, cysteine muteins, and PEGylated forms of the 
cysteine muteins can be assayed for biological activity with a cell proliferation assay using the NFS60 cell 
line as described in Examples 8 and 9 above. 

Murine wild type GM-CSF and the murine T3C GM-CSF cysteine mutein were isolated Scorn E. 
coli following the procedure described for human WT-GM-CSF (Examples 14-16) with the exception Aat 
30% ammonium sulfate was used to bind the murine proteins to a Fhenyl-Sepharose column rather than 15% 
as described for human GM-CSF. The murine T3C cysteine mutant readily PEGylated with lOkDa, 20 kDa 
and 40 kDa PEG maleimide reagents using the protocols described above for human GM-CSF A3C cysteine 
mutein. Bioactivities of these PEGylated proteins can be measured in the NFS60 cell proliferation assay as 
described in Examples 8 and 9. 

Example 18 

E. coli Expression and Purification of Wild Type human Erythropoietin 
A. Expressing Erythropoietin by secretion in E. coli. The DNA encoding wild type human 
25 Erythropoietin (Epo) was amplified by PCR from the plasmid pBBT358 <see below), which contains a gene 
for Epo in the vector pBlueBac 4.5 (Invitrogen), which has been used for expression of Epo in insect cells. 
The gene for Epo in pBBT358 is similar to the natural cDNA, except for three silent mutations at codons for 
amino acids 84 and 85 (of mature Epo) that create an Xhol restriction site to faciUtate the mutagenesis 
process. 

30 The three mutations that created the Xho I site were incorporated using the technique of 

'^mutagenesis by overlap extension" as described in Horton et al. (1993) and PCT/US00y00931. The initial, 
or •'primary" PCR reactions for the Xho I construction were performed in a 50 fd reaction volume in IX 
Promega PCR buffer containing L5 mM MgCl2 , each primer at 0.4 ^iM, each of dATP, dGTT, dTTP and 
dCTP at 200 ^M, 1 ng of template plasmid pBBT132 <the wild type Epo-Flag gene, cloned as a BaniR 1 - 

35 EcoK I fragment in pUC19, (described in PCT/USOO/00931), 2 units of Taq Platinum <BRL), and 0.25 units 
of Pfii Polymerase (Stratagene). The reactions w^ performed in a Pedcin-Elmer GeneAmp® PCR System 
2400 thennal cycler. The reaction program entailed: 95**C for 5 mmutes, 25 cycles of [94** C for 30 seconds, 
56** C for 30 seconds, 72** C for 45 seconds], a 7 mm hold at 72**C and a hold at 4**C The primer pairs used 
were 1BB361 x BB125] and [BB362 x BB126]. BB361 (5>GTTGGTCAAC TCGAGCCAGC 
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CGTGGGAG>3; SEQ ID NO:79] anneals to DNA sequences encoding amino acid residues 81-89 of mature 
Epo. BB125 (5> CTATGC GGCATCAGAGCAGATA >3; SEQ ID N0:17) anneals to the pUC19 vector 
sequence ^20 bp do^oistream of the cloned Epo sequence. The PGR products were run out on a 1.5% 
agarose gel, excised from the gel, and isolated using a QIAquick Gel Extraction Kit (Qiagen) according to 
5 the vendor protocol. These two mutagenized fragments were then "spliced" together in the subsequent, or 
"secondary" PGR reaction. In this reaction 0.3^1 of each of the gel-purified PGR products of the primary 
reactions were used as template and BB125 and BB126 were used as primers. The reaction volume was 50 
|il and 2.5 units of Taq Polymerase and 0.5 units of Pfu Polymerase were employed. Otherwise, the reaction 
conditions were identical to those used in the primary reactions. An aliquot of the secondary PGR was 

10 analyzed by agarose gel electrophoresis and the expected band of -190 bp was observed. The bulk of the 
secondary PGR reaction was "cleaned up" using the QIAquick PGR Purification (Qiagen), digested with Kpn 
I and Stu I (New England BioLabs) according to the vendor protocols. Following an additional clean up 
using the QIAquick PGR Purification Kit, the digestion products were ligated with pBBT138 (the wild type 
Epo-Flag gene cloned as a Bamn I - Ecd^ I fragment in pBlueBac 4.5, (PGTAJSOO/00931)), that had been 

15 cut with with Kpn I and Stu I, treated with calf intestinal alkaline phosphatase (New England BioLabs) and 
gel purified. The ligation reaction was used to transform £. coli and plasmids from resulting transformants 
were sequenced to identify a clone containmg the Xho I site and having the correct sequence throughout the ' 
433 bp Kpn I -Stu I segment. This clone is designated pBBT358. 

For expression of Epo fused to the STII signal peptide, (a peptide sequence which directs secretion 

20 of the mature protein into the E. coli periplasm, the oligonucleotides used in the PGR reaction were BBS83 
(5>GGAAGGGGTA GGCAGGCGGA CCAGGGGTGATC3>; SEQ ID NO:46), which anneals to the N- 
terminal coding region of the gene, and either BB585 (5>GGGGAATTCT TAAGGGTGAG CTGTGGGGGA 
GGG>3; SEQ ID NO:47) or BB586 (5>CGGGAATTGT TAGTCAGGTG TGGGGGAGGC >3; SEQ ID 
NO:48), which anneal to the G-terminal coding region of the gene. BB585 includes the codon for Argl66, 

25 the G-tenninal amino acid predicted by the cDNA sequence, whereas BB586 deletes the Argl66 codon and 
codes for Aspl65 as the G-terminal amino acid. The resultmg - 600 bp PGR products were digested with 
Mlul and Eco RI and cloned into a similarly digested pBBT227 (Example 9) vector to create fusions 
between the STII leader sequence and the amino terminal coding sequence of wild type Epo. The gene 
formed by PGR using BB583 and BB585 is temed STII-Epo-fiiU length (STII-Epo-FL), and the gene formed 

30 by PGR using BB583 and BB586 is tenned STII-Epo-des Arg (STH-Epo-dR). STO-Epo-FL and STH-Epo- 
dR clones with the correct sequence were then subcloned as Nde hEco RI fragments mto pBBT257 
(described in Example 14) to create pBBT477 and pBBT478, respectively. 

pBBT477 and pBBT478 were transformed into JM109 to create strains BOB578 and BOB579. 
These strains, along with BOB490 (pBBT257/JM109) were grown overnight in Luria Broth (LB media) 

35 containing 10 ^g/ml tetracycline at 37°C in roll tubes. Saturated overnight cultures were diluted to «- 0.025 
O.D. at A^oo in LB media containing 10 |^g/ml tetracycline and incubated at 37^C in shake flasks. Typically a 
25 ml culture was grown in a 250 ml shake flask. When culture O.D.s reached --0.3 - 0.5, IPTG was added 
to a final concentration of 0.5 mM to induce expression either Epo wild type or Epo des Argl66. For initial 
-expaiments, cultures were san^led at 4 and ^19 h post-induction. Samples were analyzed by SDS- 
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polyacrylamide gel electrophoresis (SDS-FAGE) on precast 14% Tris-glycine polyacrylamide gels and 
stained with Coomassie Blue. Induced cultures of both BOB578 and BOB579 showed a band at 
approximately 20 kDA, which is consistent with the molecular weight of wild type Epo. This band was not 
detected in the induced culture of BOB490, the vector-only control. 
5 B. Expressing Met-£rythropoietin in the cytoplasm ofE. colL As described in Example 18. A. 

, the DNA encoding wild type human Erythropoietin (Epo) was amplified by PGR from the plasmid 
pBBT358, which contains a gene for wild type Epo. For expression of met-Epo in the cytoplasm of E. coli, 
the oligonucleotides used in the PGR reaction were BB584 (5> TTC OCT AGC ATG GAT GAG CTG GAG 
GAG GAA ATT TAA ATG GCC CCA CCA CGG CTG ATG 3>; SEQ ID NO:49), which anneals to the N- 

10 terminal coding region of the gene, and either BB585 (5>GGGGAATTGT TAAGGGTGAG CTGTGGGGGA 
GGG>3; SEQ ED NO:47) or BB586 (5>GGGGAATTCT TAGTGAGCTG TGCGGGAGGC >3; SEQ ID 
NO:48), which are described above. The resulting 600 bp PGR products were digested with Mlul and Eco 
RI and cloned into a similarly digested pBBT227 (Example 9) vector to create genes encoding methionyl- 
Epo. The gene formed by PGR using BB583 and BB585 is termed met-Epo-fuU length (met-Epo-FL), and 

15 the gene formed by PGR using BB583 and BB586 is termed met-Epo-des Arg (raet-Epo-dR). Met-Epo-FL 
and met-Epo-dR clones with the correct sequence were then subcloned as Nde l-Eco RI fragments into 
pBBT257 (described in Example 14) to create pBBT479 and pBBT480, respectively. 

pBBT479 and pBBT480 were transformed mto JM109 to create strains BOBS80 and B0BS81. 
Expression experiments with these strains, along with BOB490 (pBBT257/JM109) were the same as those 

20 described above for the STII-Epo constructs. Induced cultures of both BOB580 and BOB581 showed a band 
at approximately 20 kDA, which is consistent with the molecular weight of wild type Epo. This band was not 
detected in the induced culture of BOB490, the vector-only control. 

Example 19 

25 Construction, £ coli Expression, Purification and Bioactivity of Erythropoietin Cysteine Muteins 

A. Construction of Epo Cysteine Muteins. Methods for constructing Epo cysteine muteins using 
site-directed PGR-based mutagenesis procedures and preferred sites for locations of cysteine muteins in 
EPO are described in PGT/USOO/00931, PGT/US98/14497, and Innis et al. (1990) and White (1993) and 
the various Examples provided herein. In addition, L80 is another preferred site for a cysteine substitution 

30 mutein. 

Recombinant erj^opoietin and cysteine muteins of erythropoietin can be expressed in E. coli 
using the procedures described in Example 18 for wild type EPO. The cells are lysed using B-per (Pierce) 
following the manufacture's instructions and the insoluble portion is isolated by centrifugation. The pellet is 
solubilized using 20 mM cysteine, 6 M guanidine, 20mM Tris. The mixture is stirred for 1-2 hoxirs at room 
35 temperature before being diluted 1:20 (v/v) with 20 m Tris, pH 8, 40 jxm copper sulfate, 2% lauroyl 
sarcosine. . The renaturation is allowed to sit at 4^G for 24-48 houre. The refolded EPO and EPO cysteine 
muteins are purified using an S -Sepharose column equilibrated in 20 mM Mes, pH 5, 0.01% Tween and 
20% glycerol (Buffer A) . EPO can be eluted from the S-Sepharose column usmg a linear gradient of 0 -IM 
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NaCl in Buffer A. Secondary columns for further purification of the recombinant EPO, if necessary, include 
SEC, Blue-sepharose, hydroxyapitite, or HIC resins (phenyl, butyl). 

Example 20 

S Construction of Disulfide-Iinked Trimers and 

Disulfide-linked Higher Order Multimers of Cysteine Mutelns 

GH variants having more than one "free" cysteine could be constructed and used to create higher 
order disulfide-linked multimers of hGH as described in PCT/USOO/00931. Such a variant could be 
expressed in E. coli , refolded and purified as disclosed in Examples 1 and 2 and PCT/US/00/00931. 

10 Subsequent processing steps could then be employed to induce disulfide bond formation as described in 
Example 2 and PCT/USO0/0093i. Under such conditions some hGH variants having one free cysteine, 
such as T3C, are converted virtually quantitatively to disulfide-linked dimers. Under the same or similar 
conditions mtermolecular disulfide formation by an hGH variant having two free cysteines, e. g. a double 
mutant that combined T3C and another cysteine mutein, would result in a polymerization of hGH molecules 

15 and the chain length of such polymers would in principle be unlimited. The chain length could be limited 
and to some extent controlled by addition to the polymerization reaction of hGH molecules having only one 
free cysteine such as the T3C variant and / or other cysteine muteins. Disulfide bond formation between the 
growing polymer and a molecule having only one free cysteine will "cap" or prevent further extension of 
one of the two polymerization sites in the nascent polymer. A subsequent reaction of a second hGH 

20 molecule that has only one firee cysteine with the other polymerization site of that nascent polymer 
terminates polymerization and fixes the length of that polymeric molecule. The average polymer length 
could be controlled by the stoichiometry of the reactants, i.e. the ratio of hGH molecules with two free 
cysteines to hGH molecules with one free cysteine. Average shorter polymers would be favored by lower 
ratios and average longer polymers would be &vored by higher ratios. M(»:e con^lex "branched" polymers 

25 could be constructed from reactions involving hGH variants with 3 or more free cysteines with hGH variants 
havmg only one free cysteine. 

Discrete size classes of certain polymers could subsequently be purified by chromatographic 
methods such as size exclusion chromatography, ion exchange chromatography, hydrophobic interaction 
chromatography, and the like. Similar procedures to those described for GH could be used to create 

30 disulfide-linked dimers and higher order multimers of G-CSF, alpha mterferon, GM-CSF and other protems. 

Example 21 

Cloning, Expression and Purification of Wild Type liuman Endostatin 
A.Cloning DNA sequences encoding Endostatin. A cDNA encoding Endostatin was anq>lified 
35 by PGR from a human fetal liver cDNA library (Clontech). PGR reactions were carried out with forward 
primer BB383 (5>GCTAACGCGTACGCACACAGCCACCGCGACTTCCAGCCG>3; SEQ ID NO:50) 
and reverse primer BB384 (5>CGGAATTCCTCGAGCTACTTGGAGGCAGTCATGAAGCT>3; SEQ ffi 
N0:51). Primer BB383 anneals to die 5' end of the coding sequence of human ^dostatin and the reverse 
primer, BB92, anneals to Ihe 3* end of the Endostatin wding sequence. Tte resulting ~ 600 bp PGR product 
40 was digested with Mul and Eco RI and cloned into a similarly digested pBBT227 (Example 9) vector to 
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create a fusion between the STII leader sequence and the amino terminal coding sequence of human 
Endostatin. After confirming its sequence, the gene was modified for intracellular e3q)ression by PGR 
amplification with forward primer BB434 (5'>GTGCACCATA TGAAGAAGAA CATCGCATTC 
CTGCTGGCTA GCATGCATGA CCTGCAGGAG GAAATTTAAA TGCACAGCCA CCGCGACTT03'; 
5 SEQ ID NO:52) and BB384 (SBQ ID N0:51). BB434 fuses a methionine (met) codon to the amino 
terminus of Endostatin, The resulting 630 bp fragment was digested with Ndel and SacH and cloned into a 
similarly digested STII-Endostatin-pUC18 plasmid described above. A met-Endostatin clone with the correct 
sequence (pBBT370) was then subcloned as a Nde l-Eco RI firagment into pBBT2S7 (described in Example 
14) to create pBBT371. 

10 B. Expression of Wild Type met-Endostatin in E- coll. pBBT371, which encodes Met- 

Endostatin wild type, and pBBT257, the parent vector, were transformed into coli JM109 to create 
strains BOB460 and BOB490, and into W3110 to create strains BOB461 and BOB340. These strains were 
grown overnight in Luria Broth (LB media) containing 10 ^g/ml tetracycline at 37°C in roll tubes. Saturated 
ovemight cultures were diluted to ~ 0.025 O.D. at Aeoo in LB 10 |ig/ml tetracycline and incubated at 37^C in 

15 shake flasks. Typically a 25 ml culture was grown in a 250 ml shake flask. When culture O.D.S reached 
-0.3 - 0.5, BPTG was added to a final concentration of 0.5 mM to induce expression of human met- 
Endostatin, For initial experiments, cultures were sampled at 0, 4 and -19 h post-induction. Samples were 
analyzed by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) on precast 14% Tris-glycine 
polyacrylamide gels and stained with Coomassie Blue. Induced cultures of both BOB460 and BOB461 

20 showed a band at approximately 20 kDA, which is consistent with the mature human Endostatin. This band 
was not detected m the uninduced cultures of BOB460 and BOB461 or in induced or unmduced cultures of 
BOB490 and BOB340, the vector-only controls. The -20 kDa band co-migrated with commercially 
prepared human Endostatin purchased £:om Calbiochem. 

25 Example 22 

Construction, Expression, Purification and Bioactivity of human Endostatin Cysteine Muteins 
A. Construction of Endostatin Cysteine Muteins. Eleven mutant human Endostatin genes were 
constructed using site-directed PCR-based mutagenesis procedures similar to those described in 
PCT/USOO/00931 and Innisetal. (1990) and White (1993). Four muteins [*-lC,H2C,R5Q and F7C] were 

30 constructed in the aimno-terniinal region (the amino acid residues are numbered by subtracting 130 firom the 
numbered residues in Hohenester et al. (1998)); three mutems were at residues encoded by sequences 
around the center of the gene [G90C, G98C, and H112C]; and three muteins were in its caiboxy-terminal 
region [L154C, R157C and S162C]. One additional mutem [R28C] was constructed at a residue within the 
active site of Endostatm. This could serve as a control protein in the bioassay. 

35 The source of template firagments used for the mutagenic PGR reactions was plasmid pBBT370. 

PGR products were digested with appropriate restriction endonucleases, extracted using the Qiagen PGR 
cleanup kit and ligated with pBBT370 vector DNA that had been cut with those same restriction enzymes, 
alkaline phosphatase ti:eated, and^tracted using the Qiagen PGR cleanup kit Transformants fix>m these 
ligations were grown up and plasmid DNAs isolated and sequenced. The sequence of the entire cloned 
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mutagenized PGR fragment was detennmed to verify both the presence of the mutation of interest and the 
absence of any additional mutations fliat potentially could be introduced by the PGR reaction or by the 
synthetic oligonucleotide primers. 

The cysteme substitution mutation MC was constructed using three PGR amplifications as 
5 follows. The mutagenic forward oligonucleotide BB531 (5>GAGGAAATTT AAATGTGrCCA 
GAGGCATCGG GACTTCC>3; SEQ ID NO:53) was designed to insert a TGC cysteine codon between 
the N-terminal ATG methionme codon and the first GAG histidine codon. This oligo was used in PCR#1 
with the reverse, non-mutagenic, primer BB126 (S>TGTGGAATTG TGAGGGGATA AG>3; SEQ ID 
N0:S4) which anneals to pUGlS vector sequences within 60bp downstream of the Endostatin coding 

10 sequence. The template for this PGR was a purified 1264bp Nhe VApdL I fragment derived fix)m pBBT370. 
This fragment contains the entire Endostatin coding sequence and 670bp of pUG18 sequence downstream of 
the Endostatin gene, includmg the sequence to which BB126 anneals. PGR #1 was a 25 |a1 reaction 
performed in IX Promega PGR buffer containing 1.5 mM MgGl2, each primer at 0.4 fiM, each of dATP, 
dGTP, dTTP and dGTP at 200 pM, 0,5 ng of template fragment, 1 unit of Taq Polymerase (Promega), and 

15 0.1 unit of Pfii Polymerase (Stratagene). The reaction was performed in a Perkin-Ehner GeneAmp® PGR 
System 2400 thermal cycler. The reaction program entailed: 95°G for 5 minutes, 22 cycles of [94** G for 30 
seconds, 55°G for 30 seconds, 72**G for 45 seconds], a 7 min hold at 72**G and a hold at 4°C. 

PGR #2 was performed using the mutagenic reverse oligonucleotide BB532 (5>GGAAGTCGCG 
ATGGGTGTGG GAGATTTAAA TTTGGTG>3; SEQ ID NO;55), which is the inverse complement of 

20 BB53U and the non-mutagenic primer BB125 (5>GTATGGGGGA TGAGAGGAGAT>3; SEQ ID 
NO: 17), which anneals to pUGlS sequences 40bp upstream of the Nde I site. The template for PGR#2 was 
a purified 990bp Ssp I -£coR I Augment from pBBT370 containing die entire Endostatin coding sequence 
and 367bp of pUGlB sequence upstream of the Nde I site at the 5' end of the Endostatin gene fragment, 
including the sequence to which BB125 anneals. The components and program for PGR#2 are the same as 

25 PGIWfl, Ten ^1 aliquots of PGR#land #2 were analyzed by agarose gel electrophoresis and each found to 
have produced a single fragment of the expected size. 

PGR #3 was a 50^1 reaction performed using non-mutagenic primers BBI25 and BB 126. The 
template for this PGR was 1 |al of PGR #1 and 0.3 ^il of PGR #2. The components of PGR #3 were the same 
as reactions 1 and 2. The reaction program entailed: 95°G for 5 minutes, 23 cycles of [94*^ C for 30 

30 seconds, 56°C for 30 seconds, 72''C for 1 min], a 7 min hold at 72°G and a hold at 4*'G. A 10 ^1 aliquot of 
PGR #3 was analyzed by agarose gel electrophoresis and found to have generated a 740 bp fragment, as 
expected. The remainder of the reaction was "cleaned up" using the QIAquick PGR Purification Kit 
(Qiagen) according to the vendor protocol and digested with Nhe I and BsrG I (New England BioLabs) 
according to the vendor protocols. Following an additional clean up step using tiie QIAquick PGR 

35 Purification Kit, the digestion products were ligated with pBBT370 that had been cut wifli Nhe I and BsrQ I, 
treated with calf intestinal alkaline phosphatase (New England BioLabs) and "cleaned up" using the 
QIAquick PGR Purification Kit The ligation reaction was used to transform E. coli JM109 and plasmids 
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from resulting transfoimants were sequenced. A clone having the *-lC mutation and the correct sequence 
throughout the 205 bp Me I - BsrG I segment was identified. 

The substitution mutations H2C, RSC, F7C, and R28C (i.e. changing histidine at position 2 to 
cysteine, etc.) were constructed and sequence verified using the protocols detailed above for *-lC, except 
that different mutagenic oligonucleotides were used (Table 16). The forward mutagenic oligonucleotides 
were always used in conjunction with the reverse, non-mutagenic, primer 6B126 and the piuified 1264bp 
Nhe l-ApaL I fragment as template, and the reverse mutagenic oligonucleotides were always used in 
conjunction with forward, non-mutagenic, primer BB12S and the purified 990bp Ssp I -EcaiR. I fi:agment as 
template. 

Table 16 

Oligonucleotides used to construct ^dostatin cysteine muteins 



Mutation Oligonucleotide Direction 



H2C 



H2C 



20 RSC 



RSC 



F7C 



F7C 



R28C 



35 R28C 



G90C 



G90C 



G98C 



G98C 



SO H112C 



BBS33 



BB534 



BB535 



BBS36 



BB537 



BB538 



BB539 



BB540 



BB543 



BB544 



BBS45 



BB546 



BB547 



Forward 



Reverse 



Forward 



Reverse 



Forward 



Reverse 



Forward 



Reverse 



Forward 



Reverse 



Forward 



Reverse 



Forward 



Sequence (5* > 3'): Cvs codon shown in bold 

GAGGAAATTTAAATTGCAGCCATCGCGACTTCCAG 
SEQIDNO:56 

CTGGAAGTCGCGATGGCTGCACATTTAAATTTCCTC 
SEQIDNO:57 

ATGCACAGCCACTGCGACTTCXDAGCCG 
SEQIDNO:58 

CGGCTGGAAGTCGCAGTGGCTGTGCAT 
SEQIDNO:59 

GCCACCGCGACTGTCAACCGGTGCTCCAC 
SEQIDNO:60 

GTGGAGCACCGGTTGACAGTCGCGGTGGC 
SEQIDN0:61 

CATGCGGGGCATCTGCGGCGCCGACTTCCAG 
SEQID1W:62 

CTGGAAGTCGGCGCCGCAGATGCCCCGCATG 
SEQIDNO:63 

GGCTCTGTTCTCGTGCTCTGAGGGTCC 
SEQroNO:64 

GGACCCTCAGAGCACGAGAACAGAGCC 
SBQIDNO:65 

CCGCTGAAGOCCTGCGCACGCATCTTC 
SEQIDNO:66 

GAAGATGCGTGCGCAGGGCTTCAGCGG 
SEQIDNO:67 

GACGTCCTGAGGTGCGCGACCTGGCCCCAG 
SEQIDNO:68 
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L1S4C BB549 Reverse 

5 

R157C BB550 Forward 

10 R157C BB551 Reverse 

S162C BB552 Forward 

15 

S162C BB553 Reverse 
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GGCCAGGCCTCCAGCCTCTGCGGGGGCAGGCTC 
SEQIDNO:69 

GAGCCTGCCCCCGCAGAGGCTGGAGGCCTGGCC 
SEQIDNO:70 

CTGCTGGGGGGCTGCCTCCTGGGCCAGAGTGCCGCG 
SEQIDN0:71 

CGCGGCACTCTGGCCCAGGAGGCAGCCCCCCAGCAG 
SEQIDNO:72 

CTCCTGGGGCAGTGCGCAGCGAGCTGCCATC 
SEQIDNO:73 

GATGGCAGCTCGCTGCGCACTGCCCCAGGAG 
SEQIDNO:74 



Muteins G90C and G98C were constructed by methods similar to those described for *-lC, except 

20 the mutagenic oligonucleotides were different (Table 16) and the template for PGR #3 was 0.5 jid of PGR #1 
and 0.5 yd of PGR #2, In addition, after "clean up " PGR #3 was digested with BsrG I and Bsu36 I (New 
England BioLabs) and following an additional clean up step, the digestion products were ligated with 
pBBT370 that had been cut with BsrG I and Bsu36 1, treated with calf intestinal alkaline phosphatase (New 
England BioLabs) and "cleaned up" using the QIAquick PGR Purification Kit. 

25 Muteins L154C, R157C, and S162C were constructed by methods similar to those described for *- 

IC, except the mutagenic oligonucleotides were different (Table 16) and the template for PGR #3 was 0.3 pi 
ofPCR#l and 1 ^lofPCR#2. In addition, after "clean up," PGR #3 was digested with B«i36 land £coRI 
and following an additional clean up step, the digestion products were ligated with pBBT370 that had been 
cut with Bsu36 1 and Eco RI, treated with calf intestinal alkaline phosphatase (New England BioLabs) and 

30 "cleaned up'* using the QIAquick PGR Purification Kit. 

Mutein H112C was constructed by methods different in several respects &om those described for 
*-lC. First, the sequence of the mutagenic forward oligonucleotide used in PGR #1 was different (Table 16) 
and the volume of the reaction was 50 |il instead of 25 ^il, PGR #2 and PGR #3 were not performed, 
because they v/&ce not necessary. Instead, after a 10 ^1 aliquot was analyzed by gel electrophoresis, this 

35 reaction was treated much the same as PGR #3 is normally treated. That is, the remainder of the reaction 
was "cleaned up" usmg the QIAquick PGR Purification and digested with Bsu36 I and EcoR I (New 
England BioLabs) according to the vendor protocols. Following an additional clean up step using the 
QIAquick PGR Purification Kit, the digestion products were ligated with pBBT370 that had been cut with 
^^36 1 and EcoR I, treated with calf intestinal alkalme phosphatase (New England BioLabs) and "cleaned 

40 up" using tiie QIAquick PGR Purification Kit. The ligation reaction was used to transform E. coli JM109 
and plasmids fix)m resulting transformants were sequenced. 

B. Expression of Cysteine muteins of met-Endostatin in E. coli: Each met-Endostatin 
Cysteine mutein clone witii &e correct sequence was subcloned as a Nde I-Eco RI 
fragm^t into pBBT257 (described in Example 14) to generate a set of expression 
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plasmids which were transformed in JM109 to create the strains used in expression 
studies. 

These strains were grown overnight in Luria Broth (LB media) containing 10 ng/ml tetracycline at 
37**C in roU tubes. Saturated ovmiight cultures were diluted to ~ 0.025 O.D. at in LB 10 fifi/ml 
5 tetracycline and incubated at 37**C in shake flasks. Typically a 25 ml culture was grown in a 250 ml shake 
flask. When a culture O.D. reached ~0.3 - 0.5, IPTG was added to a final concentration of 0.5 mM to 
induce expression of the .Endostatin Cysteine mutein specific for that stain. Preinduction, 4 hour post- 
induction, and 16 hr post-induction samples were collected. Samples were analyzed by SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) on precast 14% Tris-glycine polyacrylamide gels and 
10 stained with Coomassie Blue. Induced cultures of each of the Endostatin cysteine mutein strains showed a 
band at approximately 20 kDA, which is consistent with the mature human Endostatin. This band was not 
detected in the uninduced cultures or in induced or uninduced cultures of BOB490, the vector-only control. 
The -20 kDa band co-migrated with commercially prepared human Endostatin purchased firom 
Calbiochem. 

15 C. Expression and purification of endostatin and endostatin cysteine muteins: E.coli 

containing e}q)ressed wild type endostatm or endostatm cysteine mutein R5C were pelleted by centrifugation 
and fiiozen at -80** C. Cell pellets were thawed and treated with 5 mL of B-PER ™ bacterial protein 
extraction reagent according to the manufacturer's (Pierce) protocols. The insoluble material, which 
contained the bulk of the endostatin protein, was recovered by centrifiigation and resuspended in B-PER, 

20 This mixture was treated with lysozyme (200 ng/mL) for 10 min to further disrupt the cell walls, and MgCU 
(10 mM final concentration) and protease-free DNAse (2 ^ig/ml) were added. Insoluble endostatin was 
collected by centrifiigation and washed, by resuspension in water and recentrifixgation, to remove most of 
the solubilized cell debris. For refolding, the resulting pellet containing insoluble endostatin was dissolved 
in 20 ml of 8 M urea, 10 mM cysteine in 20 mM Tris Base. This mixture was stirred for 120 min at room 

25 temperature. Cystine was added to a final concentration of 10 mM before the solublization was diluted into 
200 ml of ice cold 3 M urea, 40 ^M copper sulfate, 20 mM Tris, pH 7.5. This refold mixture was slowly 
stirred at 4°C for 3days. The pH of the refold mixture was then adjusted to 5.0 with dilute HCl and the 
mixture was centrifiiged before being loaded onto a 5 ml S-Sepharose column (Pharmacia HiTrap) 
equilibrated in 40 mM sodium phosphate pH 5.0 (Bi^er A). The bound protems were eluted with a linear 

30 salt gradient firom 0-100% Buffer B (500 mM NaCl, 20 mM sodium phosphate, pH 5.0). The S-Sepharose 
firactions containing predominantly endostatm were pooled with their pH being adjusted to 7.4 before being 
loaded onto Heparin-Sepharose (Hi trap) column, previously equilibrated in 20 mM Tris, pH 7.4. The 
column was eluted with a 0-1 M NaCl salt gradient Heparin column firactions with pure endostatin were 
pooled and firozen, Endostatin cysteine mutants G90C, G98C, H112C, and R157C have also been partially 

35 purified using the above protocol^ with the heparin column step omitted. 

R5C ^dostatin cysteine mutem was PBGylated using a 15X excess of 5 kDa PEG maleunide and 
10-15-fold excess of TCEP. The reaction yielded monoPEGylated R5C protein. 
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D. Endostatin Bioassay: Refolded wild type recombinant endostatin and the refolded R5C 
endostatin cysteine mutein were shown to be biologically active using the MMP-2 inhibition assay described 
byKim,etal.(2000). 

Bioactivity of the proteins also can be measured in an endothelial cell proliferation inhibition 
5 assay. In vitro inhibition of endothelial cell proliferation can be performed as follows. Five thousand 
HMVEC-L cells (Clonetics) can be plated onto gelatinized 96-well culture plates and incubated (37**C, 5% 
CO2) for 24 hr in 100 pi HMVEC-L medium containing bFGF. The medium is then replaced with 20 jiil of 
medium containing serial dilutions of endostatin, endostatin cysteine muteins or PEGylated endostatin 
cysteine muteins, and incubated for 20 min. Eighty (xl of fresh HMVEC-L medium containing bFGF is then 
10 added to the well. After 72 hr, cell numbers can be determined. The various Endostatin proteins iwill nhibit 
proliferation of the endothelial cells, as demonstrated by dose-dependent decreases in endothelial cell 
numbers at the end of the assay. 

Example 23 

15 Refolding of Recombinant Angiostatin Cysteine Muteins 

Angiostatin is fully active when non-glycosylated and thus, does not require a eukaryotic 
expression system for production. The coding sequence for human angiostatm, consisting of the first four 
kringle subunits of human plasminogen, can be PCR-amplified from a human plasminogen cDNA template 
(available fix)m the American Type Culture Collection, Rockville, MD). Wild type angiostatin and 

20 angiostatin cysteine mutems can be secreted from E. coli by fusing bacterial signal sequences such as those 
from the STn or ompA proteins onto the N-terminus of mature angiostatin for the purpose of transporting 
the protein into the periplasmic space. This method has also been used successfully for fragments of 
angiostatin (Kjringle(K)l, K2, K3, and K2-3, (Cao et al., (1996)). Alternatively, angiostatin and angiostatin 
cysteine muteins can be e:q)ressed cytoplasmically in E. coli. or other host cell. Angiostatin has 26 

25 cysteines diat form 13 disulfides. Therefore, conventional refold protocols without an added cysteine 
blocking would likely be unsuccessful with a cysteine rich protein like angiostatin. Preferred sites for 
introducing cysteine residues into angiostatin include JC97C ( a cysteine residue added onto the N-terminus 
of mature angiostatin), T365C, 371C, S460C, A463C, and *466C (a cysteine residue added onto the C- 
terminus of the mature angiostatin protein. 

30 Bacterial cells expressing recombinant angiostatin or the angiostatin cysteine muteins can be lysed 

using B-per as described by the manufacturer's protocol (Pierce). The insoluble portion can be isolated by 
centrifugation. The pellet can be solublized using a mixture of 20 mM cysteine, 6 M guanidine, 20 mM 
Tpia base. The mixture can be stirred for 2 hours at room temperature before being diluted 10 fold into 20 
mM Tris, The refold can be held at 4'*C for 1-2 days. At the end of this time, the refold can be centrifuged 

35 and the angiostatin protein (or cysteine muyteins) can be purified by using a lysine-sepharose column The 
refold mixture can be loaded directly onto the column which is previously equilibrated in 20 mM Hcpes, 
0,15 M NaCl, pH 7.4 Angiostatin (or an angiostatin cystine mutein) can be released firom the r«sin using a 
gradient of 0-12 mM E-aminocaprioic acid. Further purification, if necessary, can be accon:q>lisbed using 
v^ous ion exchange or HIC lesins. 
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Example 24 
Peptide mapping of PEGylated proteins 
In many instances, peptide maps can be used to verify the site of PEGylation, Typically the 
PEGylated protein is specifically digested such that the cysteine mutein is present in a pq;)tide with no other 

5 cysteine residues. The presense of a PEG covalently attached to the peptide will dramatically change the 
retention time when the digestion mix is assayed by Reversed Phase HPLC. When GH is digested with 
trypsin using conditions from the literature (Clark et al., 1996), 21 possible tryptic peptides (T1-T21, 
numbered consecutively) can be isolated. Tl, representing residues 1-8 which includes the mutation T3C, 
shifts to a slightly earlier retention time for the cysteine mutant (61 minutes) versus wild type ( 64 minutes) 

10 or pituitary growth hormone. When PEGylated with a 5 K PEG, the Tl peptide moves to the end of the 
chromatogram with a retention time greater than 100 minutes. When GH is digested with endoprotease Lys- 
C, 10 peptides (Ll-10, numbered consecutively) LI representing residues 1-38 elutes at around 59 minutes 
for wild type GH and around 61 minutes for the mutein T3C. When PEGylated with a 20 K PEG, LI is 
missing from the chromatogram. These data confirm that indeed the PEG moiety is attached to the cysteine 

15 residue at posfipn 3 as predicted rather than at a native cysteuie. Enzymatic digestion and RP HPLC 
analysis ofcysteinemutiensofIFN(ti7psm and endoprotease Glu-C),GM-CSF (endoprotease Glu-C), and 
G-CSF (endoprotease Lys-C) before and after PEGylation also showed data that was consistent with a 
single site of PEGylation at the newly introduced cysteme residue. 

20 Example 25 

Peripheral Blood Progenitor Cell Mobilization Initiated by PEG-G-CSF and 
PEG-GM-CSF Cysteine Muteins 
Treatment with recombinant G-CSF and recombinant GM-CSF has been shown to mobilize 
peripheral blood progenitor cells (PBPC) that give rise to more rapid production and engraftment of 

25 neutrophils and platelets following chemotherapay. The enhancement of PBPC mobilization (and potentially 
engraftment rates) can be evaluated m the presence of the PEGylated G-CSF and PEGylated GM-CSF 
cysteine muteins. Spleenectomized mice strains known to have well defmed marrow cell profiles and 
proliferation kinectics can be given a single or daily (up to 7 days) intravenous or subcutaneous dose(s) of 
G-CSF (wild-type or Neupogen®) or PEGylated G-CSF cysteine muteins. Each experiment can also 

30 contain a group of mice treated only with a carrier, consisting of mouse serum albumin suspended m 
isotonic saline. Following treatment, peripheral blood can be harvested by cardiac puncture and collected in 
EDTA-containing tubes. CBC analysis can be performed. Bone marrow cells can be harvested by flushing 
the contents of the femur and marrow. White cell count numbers can be determined by staining with crystal 
violet and hemacytometer enumeration. Low density cells can be isolated using blood density gradient 

35 fractionation and used in progenitor cell assays. The protocol for the progenitor cell assays is outlined in 
Briddell, et al (1993). Basically, a double-layer agar based system (Bradley et al, 1978) can be used to 
evaluate both primitive (high proliferative potential-^colony-forraing cells) and mature (granulocyte- 
macrophage colony forming cells) progenitor cells. A mettylcellulose-based assay system developed by 
Iscovo et al (1974 ) can be used to evaluate erythroid colony formation. PEGylated G-CSF cysteine 
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muteins will increase mobilization of progenitor and stem cells. Similar studies can be performed with 
PEGylated GM-CSF cysteine muteins and wild type GM-CSF, Ultimately, the eflBciency of transplantation 
in lethally irradiated miceand the ability to expedite the engraftment process in the presence of PEGylated 
G-CSF and PEGylated GM-CSF cysteine muteins can be investigated. 
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What is claimed is : 

1. A method for preparing a refolded, soluble form of an insoluble or aggregated protein that is a member 
of the Growth Hormone supergene family and which contains one or more free cysteine residues, 
comprising the steps of: 

a. causing a host cell to e;q)ress a protein that is a member of the growth hormone supergene 
family in an insoluble or aggregated form; 

b. lysing the cells by chemical, enzymatic or physical means; 

solubilizing the insoluble or aggregated protein by exposing the insoluble or aggregated 
protein to a denaturing agent, a reducing agent and a cysteine blocking agent; and 
d. refolding the protein by reducing the concentrations of the denaturing agent and reducing 
agents to levels su£Eicient to allow the protein to renature into a soluble, biologically 
active form. 

2. The method of claim 1, wherein said member of the growth hormone supergene family is secreted by 
the host cell. 

3. The method of claun 1, wherein the member of the growth hormone supergene family is expressed by 
the host cell as an intracellular protem. 

4. The method of claim 1, wherein said step (b) of lysing comprises lysing the host cell in the presence of 
a cysteme blocking agent. 

5. The method of claim 1 , wherein said step (b) of lysing comprises lysing the host cell in the presence of 
a denaturing agent. 

6. The method of claim 1, wherein said step (b) of lysmg comprises lysing the host cell in the presence of 
a denaturing agent and a reducing agent. 

7. The method of claim 1, wherein said step (b) of lysing comprises: 

(1) lysing the host cell 

(2) separating soluble proteins from insoluble or aggregated proteins. 

8. The method of claim 1, wherein said cysteine blocking agent is selected from the group consisting of 
cysteine, cysteamine, reduced glutathione or tiiioglycolic acid. 

9. The method of claim 1, wherein said cysteine blocking agent is cysteine. 

10. The method of claun 1, wberem said reducing agent and said cysteine blocking agent of said step (c) 
ar« the same compound. 
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1 1 . The method of claim 10, wherem said cysteine blocking agent is selected from the group consisting of 
cysteine, cysteamine, reduced glutathione or thioglycolic acid. 

5 12. The method of claim 1, wherein said cysteine blocking agent of step (c) is a dithiol that, when reduced, 
acts as a cysteine blocking agent. 

13. The method of claim 12, wherem said dithiol is selected from die group consisting of cystine, 
cystamine, oxidized glutathione, or dithioglycolic acid. 

10 

14. The method of claim I, wherein the reducing agent is dithiothreitol (DTT) or 2-mercaptoethanoL 

15. The method of claim 1, wherein said step (d) of refolding comprises refolding the protein in the 
presence of glycerol. 

15 

16. The method of claim 1, wherein said step (d) of refolding comprises refolding the protem in the 
presence of an oxidizing agent selected from the group consisting of oxygen, a dithiol, iodine, hydrogen 
peroxide, dihydroascorbic acid, tetrathionate, or 0-iodosobenzoate. 

20 17. The method of claim 1, wherein step (d) of refolding comprises refolding the protein in the presence of 
a metal ion. 

18. The method ofclaun 17, wherein said metal ion is Cu^ or Co"^. 

25 19. The method of claim 1, wherein said step (d) of refolding comprises refoldmg the protem in the 
presence of a cysteine blocking agent 

20. The method of claim 1, wherein said step (d) of refolding comprises refolding the protein in the 
presence of a denaturing agent. 

30 

21. The method of claim 1, wherein said step <d) of refolding comprises refolding the protein in the 
presence of a dithiol. 

22. The method of claim 21, wherein said dithiol is selected from the group consisting of cystine, 
35 cystamine, dithioglycolic acid, or oxidized glutathionine. 

23. The method of claim 1, wherein said step (d) of refolding occurs in the presence of a reducirig agent. 



SUBSTITUTE SHEET (RULE 26) 



wo 01/87925 



83 



PCTAJSOl/16088 



24. The metbod of claim 23, wherem said reducing agent is selected from the groiQ> consisting of cysteine, 
DTT, 2-mercaptoethanol, reduced glutathione, cysteine, cysteamme, thioglycolic acid, or other thiol. 

25. The method of claim 1, wherein said insoluble or aggregated protein is a recombinant protein. 

26. The method of claim 1, wherein said insoluble or aggregated protein is a cysteine variant of a member 
of the growth hormone supergene family, or a derivative or an antagonist thereof 

27. The method of claim 1, wherein the member of the Growth Hormone supergene family is selected from 
the group consisting of growth hormone, prolactin, placental lactogen, erythropoietin, thrombopoietin, 
interleukin-2, interleukin-3, interleukin-4, interleukin-5, interleukin-6, interleukin-7, interleukin-9, 
interleukin-10, interleukin-11, interleuldn-12 (p35 subunit), interleukin-13, interleukin-15, IL-19, IL- 
20, oncostatin M, ciliary neurotrophic factor, leukemia inhibitory factor, alpha interferon, beta 
interferon, gamma interferon, omega interferon, tau interferon, granuloc>te-colony stimulating factor, 
granulocyte-macrophage colony stimulating factor, cardiotrophin-1, macrophage colony stimulating 
factor, stem cell &ctor and flt-31igand. 

28. The method of claun 1, frirther con^rising attaching a cysteine-reactive moiety to said isolated protein 
to foim a cysteine modified protein. 

29. The method of claim 28, wherein the cysteine-reactive moiety is selected from the group consisting of a 
polyethylene glycol, a polyvinyl pyrolidone, a carbohydrate or a dextran. 

30. The method of claim 1, further comprising attaching a cysteine-reactive polyethylene glycol moiety to a 
cysteine residue in said isolated protein to form a pegylated protein. 

31. The method of Claim 1, further comprising the step of: 

(e) isolating the refolded, soluble protein from other proteins in the refold mixture of step (d). 

32. A method for covalently modifying said isolated, refolded, soluble protein produced according to claim 
1, further comprising the steps of; 

(f) exposing the isolated protein to a disulfide-reducing agent; and 

(g) exposing the protein to a cysteine-reactive moiety to obtain a cysteine- modified 
protein. 

33. The method of claun 32, wherein said cysteine-reactive moiety is selected from the group consisting of 
a polyethylene glycol, a polyvinyl pyrOlidone, a dex^an or a carbohydrate. 
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34. The method of claim 32, fiirther comprising isolating the cysteine-modified protein from the 
unmodified protem. 

35. The method of claim 1, wherein said member of the growth hormone s\q)ergene fimily is growth 
5 hormone. 

36. The method of claim 1, wherein said member of the growth hormone si^ergene family is a cysteine 
variant of growth hormone. 

10 37. The method of claim 1, wherein said member of the growth hormone supergene family is alpha 
interferon. 

38. The method of claim 1, wherein said member of the growth hormone supergene family is a cysteine 
variant of alpha interferon. 

15 

39. The method of any one of claims 37 or 38, viierein the alpha interferon protein is alpha interferon a2. 

40. The method of claim 1, wherein said member of the growth hormone si^rgene family is granulocyte- 
macrophage colony stimulating factor (GM-CSF). 

20 

41. The method of claun 1, wherein said member of the growth hormone supergene family is a cysteine 
variant of GM-CSR 

42. The method of claim 1, wherein said member of the growth hormone supergene fiimily is granulocyte 
25 colony stunulating factor (G-CSF). 

43. The method of claim 1, wherein said member of the growth hormone supei^ene family is a cysteine 
variant of G-CSF. 

30 44. The method of claims 43, wherein said G-CSF cysteine variant contains a non-cysteme amino acid 
substituted for Cysteine- 17. 

45. The method of claim 44, wherein the amino acid substituted for oysteine-17 in said G-CSF cysteine 
variant is serine or alanine. 

35 

46. The method of claun 1, wherem said member of the growth hormone supergene &mily is 
erythropoietin. 
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47. The method of claim 1, wherein said member of the growth hormone supergene family is a cysteine 
variant of erythropoietin. 

48. A multimeric protein produced according to claim 1, comprising at least two proteins each having a free 
cysteine and wherein at least two of said proteins are attached to each other through said free cysteines. 

49. A method for preparing a refolded, soluble form of an insoluble or aggregated protein that is an anti- 
angiogenesis factor and which contains one or more free cysteine residues, conq^rising the steps of: 

a. causing a host cell to express a protein that is an anti-angiogenesis factor in an insoluble or 
aggregated form; 

b. lysing the cells by chemical, enzymatic or physical means ; 

c. solubilizing the insoluble or aggregated protein by exposing the insoluble or aggregated 
protein to a denaturing agent, a reducing agent and a cysteine blocking agent; and 

d. refolding the protein by reducing the concentrations of the denaturing agent and reducing 
agents to levels suf&cient to allow the protein to renature mto a soluble^ biologically active 
form. 

50. The method of Claim 49, frirther comprising the step of: 

e. isolatmg the refolded, soluble protein from other protems in the refold mixture 
of step (d). 

50. The method of claim 49, wherein said anti-angiogenesis factor is endostatin. 

51. The method of claim 49, wherein said anti-angiogenesis fector is a cysteine variant of endostatin. 

52. The method of claim 49, wherein said anti-angiogenesis &ctor is angiostatin. 

53. The method of claim 49, wherein said anti-angiogenesis factor is a cysteine variant of angiostatin. 

54. The method of claim 49 wherein said reducing agent and said cysteine blocking agent of said step (c) 
are the same compound. 
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SEQUENCE LISTING 

<110> Rosendahl, Mary 
Cox, George 
Doherty, Daniel 

<120> Methods for Refolding Proteins Containing Free Cysteine Residues 

<130> 4152-4-PCT 

<150> 60/204,617 
<151> 2000-05-16 

<160> 79 

<170> Patentin version 3.0 

<210> 1 

<2ai> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 1 ' 

cgcaagcttg ccaccatggc tggacctgcc acccag 36 

<210> 2 

<211> 36 

<212> DNA 

<213> Artificial 



<220> 
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<223> primer 
<400> 2 

cgcggatcct ccggagggct gggcaaggtg gcgtag 36 



<210> 3 

<211> 66 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 3 

ggcccggcca gctccctgcc gcagagcttc ctgctgaaga gcctcgagca agtgcgtaag 60 
atccag 66 

<210> 4 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 4 

cgcgaattct tagggctggg caaggtggcg 30 

<210> 5 

<211> 66 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 



<400> 5 

ggcccggcca gctccctgcc gcagagcttc ctgcttaagt gcctcgagca agtgcgtaag 60 
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atccag 66 

<210> 6 

<211> 63 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 6 

atgttcgttt tctctatcgc taccaacgcg tacgcaaccc cgctgggccc ggccagctcc 60 
ctg 63 

<210> 7 

<211> 66 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 7 

ccccctctag acatatgaag aagaacatcg cattcctgct ggcatctatg ttcgttttct 60 
c tat eg 66 

<210> 8 

<211> 29 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 8 

cgccatatga ccccgctggg cccggccag 29 
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<210> 9 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> ,9 

accaacgcgt acgcaacccc gtgtggcccg gccagc 

<210> 10 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 10 

gccatcgccc tggatcttac g 

<210> 11 

<211> 36 

.<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 11 

accaacgcgt acgcatgccc gctgggcccg gccagc 

<210> 12 

<211> 30 

<212> DNA 

<213> Artificial 
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<220> 

<223> primer 

<400> 12 

cgcgaattct tagggacagg caaggtggcg 

<210> 13 

<211> 21 

<212> DMA 

<213> Artificial 

<220> 

<223> primer 

<400> 13 

gccatcgccc tggatcttac g 

<210> 14 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 14 

cgcgaattct taacagggct gggcaaggtg gcgtag 

<210> 15 

<211> 27 

<212> DNA 

<213> Artificial 
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30 



21 



<220> 

<223> primer 
<400> 15 
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ccgctgggcc cgtgcagctc cctgccg 27 

<210> 16 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 16 

cggcagggag ctgcacgggc ccagcgg 27 

<2.10> 17 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 17 

ctatgcggca tcagagcaga ta 22 

<210> 18 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 18 

ctgggcccgg cctgctccct gccgcag 27 

<210> 19 
<211> 27 



wo 01/87925 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 19 

ctgcggcagg gagcaggccg ggcccag 

<210> .20 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 20 

aacccgtacg catgtacccc gctgggc 

<210> 21 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 21 

gcccagcggg gtacatgcgt acgcgtt 

<210> 22 

<211> 22 

<212> DNA 

<213> Artificial 
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27 



27 



<220> 
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<223> primer 
<400> 22 

tgtggaattg tgagcggata ac 

<210> 23 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 23 

ggaatggccc cttgcctgca gcccacc 

<210> 24 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

c223> primer 
<400> 24 

ggtgggctgc aggcaagggg ccattcc 

<210> 25 

<;211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer . 

<400> 25 

gccctgcagc cctgccaggg tgccatg 
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22 



27 



27 



<210> 26 
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<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 26 

catggcaccc tggcagggct gcagggc 27 

<210> 27 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 27 

ggtgccatgc cgtgcttcgc ctctgct 27 



<210> 



28 



<211> 



27 



<212> 



DNA 



<213> 



Artificial 



<220> 



<223> 



primer 



<400> 28 

agcagaggcg aagcacggca tggcacc 



27 



<210> 29 



<211> 27 



<212> DNA 



<213> Artificial 
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<220> 

<223> primer 
<400> 29 

ccggccttcg cctgtgcttt ccagcgc 27 

<210> 30 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 30 

gcgctggaaa gcacaggcga aggccgg 27 

<210>. 31 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 31 

cccacccagg gttgcatgcc ggccttc 27 

<210> 32 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 



<400> 32 

gaaggccggc atgcaaccct gggtggg 



27 
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<210> 33 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 33 

atgccggcct tctgctctgc tttccag 27 

<210> 34 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 34 

ctggaaagca gagcagaagg ccggcat 27 

<210> 35 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 35 

ggccattccc agttcttcca t 21 

<210> 36 

<211> 24 

<212> DNA 

<213> Artificial 
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<220> 



<223> primer 



<400> 36 

ttcgttttct ctatcgctac caac 



24 



<210> 37 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 37 

ctgcaggccc tgtgtgggat ctccccc 27 

<210> 38 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> .38 

gggggagatc ccacacaggg cctgcag 27 

<210> 39 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 



<223> 



primer 



<400> 



39 
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ctggaaggga tctgccccga gttgggt 27 

<210> 40 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 40 

acccaactcg gggcagatcc cttccag 27 

<210> 41 

<211> 35 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 41 

cgcgctgcag ttctcatgtt tgacagctta tcatc 35 

<210> 42 

<211> 42 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 42 

cgcgctgcag atttaaatta gcgaggtgcc gccggcttcc at 42 

<210> 43 
<211> 38 
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<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 43 

gcgacgcgta cgcagcaccc acccgctcac ccatcact 

<210> 44 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 44 

gcggaattct tatttttgga ctggtttttt gcattcaaag gg 

<210> 45 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 45 

gcgacgcgta cgcagcaccc tgccgctcac ccatcact 

<210> 46 

<211> 32 

<212> DNA 

<213> Artificial 



<220> 
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<223> primer 
<400> 46 

ccaacgcgta cgcagcccca ccacgcctca tc 32 

<210> 47 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 47 

ccggaattct taacggtcac ctgtgcggca ggc 33 

<210> 48 - . 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 48 

ccggaattct tagtcacctg tgcggcaggc 30 

<210> 49 

<211> 57 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 49 

ttcgctagca tgcatgacct gcaggaggaa atttaaatgg ccccaccacg cctcatc 57 



<210> 50 
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<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 50 

gctaacgcgt acgcacacag ccaccgcgac ttccagccg * 39 

<210> 51 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 51 

cggaattcct cgagctactt ggaggcagtc atgaagct .38 

<210> 52 

<211> 90 

<212> DNA 

«:213> Artificial 

<220> 

<223> primer 
<400> 52 

gtgcaccata tgaagaagaa catcgcattc ctgctggcta gcatgcatga cctgcaggag 60 
gaaatttaaa tgcacagcca ccgcgacttc 90 

<210> 53 

<211> 37 

<212> DNA 

<213> Artificial 
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<220> 

<223> primer 

<400> 53 

gaggaaattt aaatgtgcca cagccatcgc gacttcc 37 

<210> 54 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 54 

tgtggaattg tgagcggata ac 22 

<210> 55 

<211> 37 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 55 

ggaagtcgcg atggctgtgg cacatttaaa tttcctc 37 

<210> 56 

<211> 35 

<212> DNA 

<213> Artificial 



<220> 
<223> 
<400> 



primer 
56 
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gaggaaattt aaattgcagc catcgcgact tccag 35 

<:210> 57 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 57 

ctggaagtcg cgatggctgc acatttaaat ttcctc 36 

<210> 58 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 58 

atgcacagcc actgcgactt ccagccg 27 

<210> 59 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 59 

cggctggaag tcgcagtggc tgtgcat 27 

<210> 60 

<211> 29 



wo 01/87925 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 60 

gccaccgcga ctgtcaaccg gtgctccac 

<210> .61 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 61 

gtggagcacc ggttgacagt cgcggtggc 

<210> 62 

<21i> 31 

<212> DiNA 

<213> Artificial 

<220> 

<223> primer 

<400> 62 

catgcggggc atctgcggcg ccgacttcca g 

<210> 63 

<2ia> 31 

<212> DNA 

<213> Artificial 
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<223> primer 
<400> 63 

ctggaagtcg gcgccgcaga tgccccgcat g .31 

<210> 64 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 64 

ggctctgttc tcgtgctctg agggtcc 27 

<210> 65 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 65 

ggaccctcag agcacgagaa cagagcc 27 

<210> 66 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 66 

ccgctgaagc cctgcgcacg catcttc 27 



<210> 67 
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<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 67 

gaagatgcgt gcgcagggct tcagcgg 27 

<210> 68 

<211> 30 ' , 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 68 

gacgtcctga ggtgcccgac ctggccccag 30 

<210> 69 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 69 . 

ggccaggcct ccagcctctg cgggggcagg etc 33 

<210> 70 

<211> 33 

<212> DNA 

<213> Artificial 
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<220> 

<223> primer 
<400> 70 

gagtictgccc ccgcagaggc tggaggcctg gcc 33 

<210> 71 

<211> 36 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 71 

ctgctggggg gctgcctcct gggccagagt gccgcg 36 

<210> 72 

<211> 36 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 72 

cgcggcactc tggcccagga ggcagccccc cagcag 36 

<210> 73 

<211> 31 ' 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 
<400> 73 

ctcctggggc agtgcgcagc gagctgccat c 31 
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<210> 74 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 74 

gatggcagct cgctgcgcac tgccccagga g 31 

<210> 75 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<:400> 75 

gacactgctg ctgagatgaa t 21 

<210> 76 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> primer 

<400> 76 

cttgtagtgg ctggccatca tg 22 

<210> 77 

<211> 57 

<212> DNA 

<213> Artificial 
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<220> 

<223> primer 

<400> 77 

cgcaacgcgt acgcagcacc ggcccgctcg ccgagcccga gcacgcagcc gtgggag 57 

<210> 78 

<211> 57 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 78 

cgcgaattct tactcctgga ccggctccca gcagtcaaac gggatgacca gcagaaa 57 

<210> 79 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 



<400> 79 

gttggtcaac tcgagccagc cgtgggag 



28 
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